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APPLICATIONS OF TWO OSCULATORY FORMULAS 

By John L. Roberts 

INTRODUCTION 

The main purpose of this paper is to illustrate how Mr. Jenkins’ osculatory 
formulas (A) and (B) can be applied in a convenient manner. The first section 
of this paper will be little more than a summary of some of the formulas con¬ 
tained in the other three articles. The second section will contain the appli¬ 
cations. 

I. SOME MATHEMATICS OF THE FORMULAS 

The Woolhouse notation will in this paper be used to stand for the differences, 
of u x+n which represents the given values of a function. The general formulas are 

Vt = Vo + xhyo -f \x{x - 1)5 + fa(x - l)(x - i)C; (1) 

and 

y x - u<> + xai + \x{x - 1)5 + \x(x - l)(a: - |)C. (2) 

The special formulas belonging to (2) are 

5 — b —,\d and C = Ci — fei, (A) 

where b and d are defined by b = |(fi 0 + h) and by d ~ \(da + di); and 

B = b and (7 = 0. (B) 

The special formulas belonging to (1) are 

Vo = wo + B = b, and C = 0; (C) 

and 

yo — Ua — Tjfdo i B — b gd, and C — ci gei . (D) 

Formula (C) is equivalent to Mr. Jenkins’ formula (A). Also (D) is equivalent 
to his formula (B). 


1 This paper presupposes a knowledge of three other articles. The first one by Mr. 
Wilmer A. Jenkins is entitled “Graduation Based on a Modification of Osculatory Inter¬ 
polation,” and is printed in the October 1927 issue of the Transactions of the Actuarial 
Society of America. The other two papers are mine. One of them is entitled “Some 
Practical Interpolation Formulas,” and is printed in the September 1935 issue of these 
Annals. The other one. entitled “A Family of Osculatory Formulas” is printed in the 
October 1935 issue of the Transactions. 
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II. applications of (c) and (d) 

First, there is the problem of selecting suitable examples to which (C) and (D) 
can be’applied. Secondly, we will then apply in a convenient manner the 
formulas to these examples. 

The problem of selecting suitable examples will now be considered. “The 
non-reproducing characteristic of” formula (D) “raises the question of what 
will happen in the graduation of a series whose fourth differences are all posi¬ 
tive, say. The answer is that the graduated series will lie everywhere below 
the observed points and that the observations will not be correctly represented 
by the interpolated series.” On the other hand, if we select a series whose 
fourth differences change frequently in sign, (D) because of its non-reproducing 
characteristic has valuable smoothing possibilities. In like manner, (C) may 
be valuable when the second differences change frequently in sign. Mr. Jenkins 
gives at quinquennial ages rates of mortality which were graphically determined 
from the published American Men Ultimate Experience. Since the fourth 
differences of these rates change frequently in sign, we will apply (D) to a few 
of these rates. So far as I know no suitable actuarial examples have been 
found to which (C) can be applied. However, there is the possibility that (C) 
might be valuable in some sciences. Since I do not know of any suitable real 
example to which (C) can be applied, we will apply it to a trivial series whose 
second differences change frequently in sign. 

We are now ready to apply in a convenient manner (C) and (D) to the 
examples selected in the preceding paragraph. 

First, we will apply (C). I have in my other article applied (B) in a con¬ 
venient manner. This method with little change can be applied to (C): If 
it is desired to apply (C) at either end of the table where values of u x are not 
available for the calculation of the second differences, it can be assumed they 
vanish. It is convenient if S and S 2 represent respectively the major differ¬ 
ences Ar* and A 2 u x in such a manner that they are arranged centrally in the 
working illustration. It is convenient if s and s 2 represent respectively the 
minor differences 8y x and d 2 y x ■ The quantity y Q can be computed by yo = 
u>a + i&o, and y\ can be computed in like manner. Since we wish in the working 
illustration of (C) to interpolate four values between y 0 and yi , the middle 
s — 8y .4 = .2Ay 0 , and s a = .04 B = .02(6 0 4 - hi). We can by the use of the 
foregoing method apply (C) to suitable functions, whose given values can be 
represented by /(r). Then, it follows from the definition of u x that f(r) — u x . 
It might prevent confusion if it is stated that x and r are related to each other 
in such a way that we always interpolate between y Q and yi , We shall now 
apply (C) to the case when/(r) represents the trivial series shown at top of 
page 3. 

Finally, we will apply (D). Mr. Henderson has applied (A) in a very con¬ 
venient manner. His method with little change can be applied to (D). If it 
is desired to apply (D) at either end of the table where values of u x are not 
available for the calculation of the differences required, it can be assumed 
that the fourth differences that can not be competed vanish, and. the required 
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differences can be filled in consistently with that assumption. It is convenient 
if S, S 2 , and S 3 represent respectively the major differences A u x , A 2 ^, and 
A 3 ^ in such a manner that they are arranged centrally in the working illustra¬ 
tion. It is convenient if s, s 2 } and s 3 represent the minor differences so that 
by definition s = s x = 8y x , s 2 — s 2 x = 5 2 y x -. 2 , and s 3 = 5 z y x . The first 
s 2 = <5 2 t/_. 2 = .04(6 0 — Ido). The lasts 2 = 5 z y. g = .04(bi. — fdi). The quan¬ 
tity 2/0 can be computed by y 0 = uq — ^do , and yi can be computed in like 
manner. The middle s = 5 y A = .2A y 0 — s 3 . We are now in position to 
apply (D) to the quinquennial rates of mortality. 


Age 

Rate 

£ 


£ 3 


72 

.07010 

.03808 




77 

.10818 

.04669 

.00861 

.01799 


82 

.15487 

.07329 

.02660 

- .01946 

-.03745 

87 

.22816 

.08043 

.00714 

.12572 

.14518 

92 

.30859 

.21329 

. 13286 

.12572 

.00000 

97 

.52188 


.25858 


.00000 
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Age 

1/x 

5 

S 2 

S 3 

82 

.15591 

12612 

.001314 


83 

.168522 

13527 

915 


84 

.182049 

.014043 

516 

- .000399 

85 

.196092 

14160 

117 


86 

.210252 

13878 

-.000282 


87 

.22413 

13460 

-.000682 


88 

.237590 

13977 

.000517 


89 

.251567 

.015693 

1716 

.001199 

90 | 

.267260 

18608 

2915 


91 

.285868 

22722 

4114 


92 

.30859 

28006 

.005314 


93 

.336596 

34326 

6320 


94 

.370922 

.041652 

7326 

.001006 

95 

.412574 

49984 

8332 


96 

.462558 

59322 

9338 


97 

.52188 


.010343 




SOME SIMPLE DEVELOPMENTS IN THE USE OF THE 
COEFFICIENT OF STABILITY 


By C. H. Forsyth 

Some time ago the writer proposed 1 a coefficient of stability C„ to be used 
to measure the stability of a statistical series, where that coefficient is defined 
by the relation 


where M denotes the arithmetic mean and <r 2 the square of the dispersion of 
the terms of the series. It was proposed to regard series as unstable (Lexian) 
for which the value of the coefficient exceeded unity, and stable otherwise. 
The only essential way in which such a procedure differs in results from the 
traditional method is that it includes as stable those series for which the value 
of the coefficient lies between unity and q the probability of failure of the event 
under investigation—series which would be classed as unstable according to 
the traditional method. Stable series—according to either standard—are found 
so rarely in practice and therefore so many series are accepted as fairly stable 
which come anywhere near meeting the requirements that replacing q by unity 
as the line of demarcation affects the classification of no known series but 
adds to the effectiveness of the avowed purpose and use of the proposed coeffi¬ 
cient—to avoid the round-about work of computing values of probabilities. 
Another merit of the use of the coefficient is that it enables one to measure 
and therefore compare the stability of several, series—a feature which we shall 
illustrate later. 

In brief, such a coefficient provides a means of introducing the whole'Lexian 
theory into Federal publications such as those on vital statistics, since a com¬ 
parison of the values of the coefficient for, say different communities or countries, 
would be readily grasped by any reader, whereas the traditional method would 
prove too subtle and laborious, and allow no ready comparison of results. 

For purpose of orientation let us illustrate the situation by analyzing a simple 
series both ways—the traditional way and by the use of the coefficient of sta¬ 
bility. As an example, let us consider the death rates of white infants under 
one year of age for 1919 (considered on page 89 of the Handbook) for those 
states whose frequencies of births are comparable or which vary little from 


1 Journal of the American Statistical Association, June, 1932. 
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their average of 47,830—where the number, of deaths for each state has been 
adjusted to this average as a base, 


Adjusted 
Deaths X 


Cal..... 

3350 

Conn. 

4700 

Ind. 

3732. 

Kan. 

3253 

Ky. 

3686 

Minn. 

3159 

N. Car. 

3541 

Va. 

3732 

Wis. 

3780 


9)32933 


M = 3659 


X - 3659 

(X - 3659)* 

-309 

95481 

1041 

1083681 

73 

5329 

-406 

164836 

27 

729 

-500 

250000 

-118 

13924 

73 

5329 

121 

14641 

1335-1333 

) 1633950 


181550 = <r 2 
<r = 426 


The traditional method would be: 

The mean M = np = 3659 where n = 47,830. 

„ 3659 , 44171 

lienee p > pad « - ^ 

and n s H — npq = 3659 f ^ggg ) = 6378 
whence <t b = 58.15 


which is the value of the dispersion we should expect if the basic probability 
were constant throughout. But the value of the dispersion proves to be 
<r — a/181550 bb 426, and the comparison of the values shows that the basic 
probability to be very variable £nd therefore the series to be very unstable or 
Lexian. * 

The computation of the value of the coefficient of stability is much more 
simple and direct 


^ = 181550 

~ M “ 3659 


49,6 


whose excess over unity also clearly indicates the instability of the series. 

Since proposing the coefficient of stability the writer has been impressed by 
the overwhelming proportion of existing series (such as birth rates, various kinds 
of death rates, etc.) which employ arbitrary bases (such as “per thousand/' 
"per ten thousand,” etc.) usually without mention of the actual base. It is 
obvious, of course, that such rates, or occurrences per arbitrary base, say 5, 
can first be adjusted to give occurrences per actual base, say B (assuming that 
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base B* can be determined) but the work can evidently be performed much 
easier. For, since the original scries (per arbitrary base b) Xi , Xi , * ■ • X* 

would become, on adjustment, ^Xi, ^X 2 , ■ • - the mean would become 

bob 

j-M and the square of the dispersion (— c ), whence the formula for the coeffi¬ 
cient of stability would become 


C s = 


M' b 


( 2 ) 


As an example, let UvS consider the general death rates, per 10,000, of New 
Zealand for the years 1921-30. 



X 

X — 86 

O 

O0 

I 

1921 

87 

1 

1 

1922 

88 

2 

4 

1923 

90 

4 

16 

1924 

83' 

— 3 

9 

1925 

83 

-3 

9 

1926 

87 

1 

1 

1927 

85 

' -1 

1 

1928 

85 

-1 

1 

1929 

88 

2 

4 

1930 

86 

0 

0 


10)862 

M = 86.2 

10-8 

)46 

4.6 


This example illustrates the danger of using the coefficient of stability unless 
the series consists of actual occurrences or unless the actual base is given due 
consideration. Without due consideration of the actual base (here the popula¬ 
tion of New Zealand) one might easily fall into the error of regarding the value 
of the coefficient of stability as 4.6/86.2 and, therefore, the series as very 
stable. But the population of New Zealand is about a million and a half and, 
therefore the true value of the coefficient of stability is 

A6 1,500,000 

* 86.2 10,000 


* Strictly speaking, this actual base B should be constant throughout the series; other¬ 
wise the successive numbers of occurrences—-the terms of the series—would not be com¬ 
parable. Where, however, the base B varies little from term to term—as usually happens 
even in the best of series, such as a series of some kind of rates of the same community 
over a short interval—the variation can be ignored, in which ca§o base B (to which the 
terms of the series are adjusted) usually means the arithmetic mean of the different bases. 
In the first treated above, the investigation was limited to certain states in an effort to 
comply with the rule just mentioned but the example is a poor one since the variations 
are still dangerously too large. The situation is saved by the conclusive results. 
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which shows the series to be unstable. However, before we condemn New 
Zealand’s death rates too severely, let us compare her record with those of 
other important countries, including our own, for the same period. 


General Death Rates (per 10,000) 

M 


New Zealand. 86.2 

Australia. 94.3 

Sweden. 120.4 

Scotland. 137.3 

Austria. 151.1 

United States. 118.0 

England-Wales. 121.3 

France... .... 170.3 

Spain. 193.7 

Italy. 163.5 

Germany. 125.4 

Japan... 206.4 


G, 

8 

90 

96 

139 

536 

830 

1117 

1129 

2190 

2760 

6040 

6800 


These results show how extremely unstable most series of general death 
rates are and that the series for New Zealand, while unstable according to 
our strict criterion, enjoys quite an enviable position practically in a class by 
itself. Parenthetically, these results also illustrate fairly well the triviality, 
with respect to results, of replacing q by unity as the critical value of the coeffi¬ 
cient of stability, discussed at the beginning of this article. 

The values of the coefficient listed above would, of course, be reduced some¬ 
what in most cases if the trend of the series were first eliminated but the writer 
has gone though all this'work and found it not worth while—that is, the series 
would still remain markedly unstable. , . 


Another development proves useful when, as frequently happens, the actual 
base B is unknown to a degree of accuracy desirable for use in formula (2). 

Prom the inequality g 1 
M o 


we obtain 



( 3 ) 


which is to he used to show how small an actual base should be for the given 
series to be stable. As an example, let us consider the maternal mortality, 
per 10,000 live births, in the so-called expanding registration area of the United 
States. 
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Maternal Deaths in the United States (per 10,000 live births) (Expanding 

Registration Area) 


X X - 66 (X - 66) 2 


1923 

67 

1 

1924 

66 

0 

1925 

65 

-1 

1926 

66 

0 

1927 

65 

-1 

1928 

69 

3 

1929 

70 

4 

1930 

67 

1 

1931 

66 

0 

1932 

64 

-2 


10)665 

66.5 

9-4 


1 

0 

1 

0 

' 1 
9 

16 

1 

0 

_4 

)33 

3.3 


Hence, by formula (3), B g (10,000) or about 200,000. The number 

< of live births varies so greatly that we should probably find it impossible to 
agree upon a satisfactory number 2 to use as an actual base for such an li ex¬ 
panding area” but we should all agree that it would be so much greater than 
200,000 that the instability of the series would be unquestioned. 

One must be careful in comparing the results of two or more investigations 
like the One just conducted. For example, the analogous result for Canada, 
for the same period yields B g 113,000 and we might conclude, too hastily, 
that the United States series is more stable (or less unstable) whereas any 
knowledge whatever of the numbers of live births of the two countries would 
show that Canada comes much closer to fulfilling her requirement than the 
United States and that the palm must go to Canada. For one thing, Canada 
has about the population of New York city and New York city has about 
100,000 live births annually. In any case, close decisions in matters of this 
kind would be difficult without sufficient information in regard to actual bases. 

There is still another situation whieh is interesting but of much less impor¬ 
tance because of the rarity of its occurrence. . It will be recalled that the coeffi¬ 
cient of stability was devised mainly to avoid the use and computation of 
probabilities and that the only difference between the results by the traditional 
method and by the use of the coefficient of stability lies in the trivial replace¬ 
ment of the critical value q by unity. In the traditional method of analysis, 
but by comparing the value of the coefficient of stability with the coefficient 
is evidently always, strictly speaking, a function of the actual base B . In 
other words, there is no statistical series, however stable it may seem—except 


2 It was in the neighborhood of two million in 1932. 
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for the trivial case when all the terms of the series are exactly the same but 
what would be unstable if the base were small enough. It is possible to formu¬ 
late the' limit once for all below which the given (otherwise seemingly stable) 
series would prove unstable. 

If, in the relation a S npq (for stability) we replace p by M/n, q by 1 — M/n 
and then n by B, we obtain 


. ., M 2 M 2 

' * M “ T or T 


<M- 


whence, finally 



(4) 


where the transference of the term M — <r 2 from one side to the other should 
cause no apprehension since, by hypothesis, <r 2 < M and M — <r 2 is therefore 
always positive. We propose to employ formula (4) in those rare cases where 
the value of the coefficient of stability of actual occurrences—but without 
reference to an actual base—is less than unity—that is, where the given series 
proves to be stable according to the method proposed by the writer—and 
determine the upper limit of the values of the base B for which the series would 
be unstable according to the traditional method of analysis. As an illustra¬ 
tion, let us consider the familiar series of annual football fatalities in this country 
for the period 1906-1930* (omitting the years when no records were kept). 


Football Fatalities 


1906 

11 

1917 

12 

1907 

11 

1921 

12 

1908 

13 

1923 

18, 

1909 

12 

1925 

20 

1911 

11 

1926 

9 

1912 

13 

1927 

17 

1913 

5 

1928 

' 18 

1914 

13 

1929 

12 

1915 

15 

1930 

13 


It is easily verified that C s = which is clearly less than unity; whence 

the series clearly seems stable. Applying formula (4) 


B g 


13.055 2 

13.055 - 11.942 


or 153 


which shows that the given series is stable as long as the total number of foot¬ 
ball players exceeds the number 153. A recent news item quoted an estimate 
of the number players participating in games of four hundred colleges as about 



USE OF COEFFICIENT OF STABILITY 


11 


13,000 and over 600,000 including high schools and all. We can then definitely 
say that the series just considered is stable. Such a conclusion has no bearing, 
of course, upon what might happen if other terms were added to the series. 
It happens that adding the records for the next five years—1931(33), 1932(32), 
1933(27), *1934(25), 1935(30)—would change the whole series to an unstable 
one with C, = 56.9/16,6 = 3.4; but, obviously, the additional records belong 
to a new regime of collection. 



INTERNAL AND EXTERNAL MEANS ARISING FROM THE SCALING 
OF FREQUENCY FUNCTIONS 

By Edward L. Dodd 

The scaling 1 of frequency functions has been discussed from the standpoint 
cf maximum likelihood. But the likelihood criterion to be satisfied sometimes 
leads to a minimum likelihood; and sometimes to neither a maximum nor a 
minimum. Scaling will be studied in this paper with reference to the likelihood 
actually secured, and also with reference to the character of means obtained, 
whether internal or external. 


SECTION 1. INTRODUCTION 


It is well known that a scale obtained in a curve-fitting process is sometimes 
a mean. Thus, with the normal function 


( 1 ) 


1 c '(*/a )!/2 

a V^ir 


if the scale a is to he obtained from measurements, *i, * 2 , • • • , x„, we com¬ 
monly accept the value 

(2) a '{;E*i} m ; 

that is, the root-mean square of the measurements. Here, the positive value 
of a is naturally taken. It is called the standard deviation, and thought of as 
an appropriate new unit of measure. 

But even with the x’s all negative, and the a taken positive, 0. Chisini 2 con¬ 
sidered it proper to regard a as a mean of the re’s, albeit an external mean. 
From Chisini's viewpoint, this a whether regarded as positive or negative is 
primarily a solution of 


(3) x\ + xl + •. ■ -f x\ = o 2 -f a 2 -|- • •. 4- a i . 

In this sum of squares, the single number a may be substituted for each of the 
x s. Perhaps this kind of mean should be called a substitutive mean to dis¬ 
tinguish it from the means of general analysis which are always internal. 


1 Fisher, R. A,, “On the mathematical foundation of theoretical statistics,” Philo¬ 
sophical Transactions of the Royal Society of London, Series A, Vol. 222, 309-568, (19211. 
See p. 338. ' ’ 


8 Chisini 
(1929). 


, 0., Sul concetto di media,’’ Periodico di matematico, Series4, Vol. 9,106—116 
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The normal function is a particular case of a more general function: 

(4) Constant*a -1 e* co , 4 >(t) = -t p /p, t = x/a. 

The likelihood method to find the scale a for this function leads to power means, 
including the arithmetic mean, the root-mean-square, root-mean-cube, etc., for 
p = 1, 2, 3, etc. 

The word scale will be used only for a positive number,—which then may be 
regarded as a unit of measurement. 

Tor measurements, Xi } x 2} • ■ ■ , x n Chisini regarded M as a mean, relative 
to a function G, provided 

(5) G(x lf s 2 , ,x n ) = G(M, M } ...,1), 

If a solution of this equation is 

(6) M = F(xi, x 2 , , x n ) f , 

and c is a possible value for the x’s, it follows at once that 

(7) F(c, c, - ■ • , c) = c, 

or at least one value of this F is c. Conversely, if (7) is satisfied, it is but a 
change of notation to replace c in (7) by M, and to combine this with (6) to 
obtain 

(8) F{x u • * * , x n ) = F{M ) M, • • • , M). 

Hence, this F which in (6) gives explicit form to the implicit M found in (5) 
may also be thought of as a mean-forming function, such as G in (5). Briefly, 
F is a particular G. Thus F(x i, x s , • •« x n ) is a mean of Xi, x 2 , • • • , x n} if F 
is so constructed that (7) is satisfied when the arguments are all equal. 

Inasmuch as a frequency function/© is non-negative, log e /© is real,—say 
0© plus constant. Following R. A. Fisher, it will be convenient to write 

(9) /© = CaT 1 e 0(O , ' C — Constant 

With location m already determined, the x’s will be, thought of as measured, 
from m. And we set 

(10) t = x/a , U = Xi/a , i - 1 , 2, • • • , n. 

The “productive 7 ’ probability—to yield X\ } ••• , x n —is then 

(11) L = n/(<i) = C n aT n 

This is proportional 3 to the “likelihood” of a. Also—it may be noted in 
passing—the productive probability is also proportional to the a posteriori 
probability, if a constant a priori probability is postulated. The likelihood 
will here be taken as Uf(U) itself; and it will be designated by L ,—in Fisher’s 


8 Loc. Cit,, Fisher, p. 310. 
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notation, L = log H. Of course, IT and log IT take maximum values simul¬ 
taneously, if at all. From (11) it follows 4 that 

(12) ' -a -9 log L/da - n + 21,4'(U) = + 1|- 

The equation 

(13) 2t4'{ti) + n = 0 ■ (i = 1, 2, • • • , n) 

will be called the likelihood condition, whether this leads to maximum likeli¬ 
hood, to minimum likelihood, or to neither. A second differention 5 leads to 

(14) a • d 2 log L/ba = 2&>"(i<) - n = - 1}. 

When negative, this indicates a maximum likelihood; when positive, a minimum 
likelihood for the a obtained from (13). 

Preparatory to the theorems of the next section, just one more matter will 
be discussed. The unit for t is arbitrary; and it may be convenient to write, 
with k 7 * 0, 

(15) 4>(i) = <t>(ku) = 4>(«), t = ku. 

Then 

(16) = mV(u). 

Suppose, now, that a positive constant fc can be found such that fa//(ft) = — 1. 
Then, with t = ku, as postulated, 

(17) 1-$'(1) = Jty'(ft) - -1. 

Thus $'(1) = — 1,—or as it will now be written $'( 1) = — 1,—is no more 
restrictive than the condition that some positive k exists such that fa//(ft) = — 1. 

SECTION 2. GENERAL THEOREMS- CONCERNING THE SCALE AS A MEAN 

Theorem I 

Given the frequency function 

/(0 = Ca 1 e* <() , t = x/a, 1, = Xi/a, C = Constant. 
And suppose that 

( 19 ) 4>'(1) = -1. 

Suppose, also, that for given x u x 2 , • ■ • , x„, the likelihood condition (13), 
now written 

(Xi/a)<t>'(xi/a) + n = 0, ■ 


* Loc. Cit,, Fisher, p. 338, 
‘Loo. Cit., Fisher, p. 339. 
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has a positive solution. 

(21) a = F(x i, x t , ••• , x„). 

Then this a, the scale, is a mean. 

Proof. With each x t > = 0, (20) cannot bo satisfied. 

But if, with c 7* 0, we take each x t - = c, and at the same time set a = c, 
then, by (19), 2 — — n; and thus (20) which gives a implicitly is satisfied. 
The explicit a in (21) is therefore such a function F that (7) is satisfied. Hence, 
the scale a is a mean. 

Theorem II 

Given the frequency function 

(18) f(t) — CaT 1 e* U) , t = x/a , t { — x i/a, C = Constant. 

Suppose that 

(19) *'(1) - - 1, 
and that 

(22) ■ | Ut>\t) | < 1 if |*| < ]. 

Moreover, suppose that the likelihood condition (20) for measurements 
xi, x 2 , • • • j Xn , has a positive solution a. Then 

(23) • a g Maximum | X{ |. 

Or, suppose that, in place of (22), we have 

(24) |ty'(i)|>l if |*| > 1; 
and that Uj>'(t) keeps the same sign, if | * | > 1. Then 

(25) Minimum | x» | g a. 

Proof . Suppose, if possible, that a > Max |x<|. Then each | Xi/a \ < 1, 
and by (22), | (x;/a)<?5/ ( Xi / a ) \ < 1. Then (20) is not satisfied, since | 2 ] < n. 
Thus the hypothesis is contradicted. 

Now (25) is satisfied at once if any x t * = 0. But suppose, on the other hand, 
that Min | x t - 1 > 0; and, if possible, that a < Min \% i \. Then, by (24) et 
seq,, since | Xi/a | > 1, it follows that \X \ > n. And thus (20) is again con¬ 
tradicted. 

Theorem III 

Given the frequency function 

(18) /(*) = Ca' 1 e* {i \ * = x/a, . *,• = x,-/a, C = Constant; 

and set ^(*) = *</>'(*) + 1. Suppose that 

(26) lim Hi) = a, lim t(t) = p, afi < 0. 

i ->o | t\ -> 
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And suppose that ^(0 is continuous when t ^ 0. 

Then, for any sot of real numbers, .tj, ■ , x n , of which none is zero, 
there exists a positive number a, as scale, such that the likelihood condition 

(20) £i(*r/a)$W fl ) -f n = 0 

is satisfied. 

The conclusion is also valid, if in place of the limit /3, there is postulated 
(27) lim ^(/) - - a | oo | = lim i pfy), 

Ij-ItO i —* e—0 

where b > 0, c > 0, and is continuous for — t < t < 0 and for 0 < / < c. 
That is, the new limits arc to be infinite with sign opposite to that of a. 

Proof. The limits for t —> 0 and for J t j —» «> are the same as the limits 
for a —► oo and a —> 0+,—noting that i = x/a } x ^ 0. Thus 2^{i,) changes 
sign as a goes from 0+ to Hence, .since \p(t) is continuous, (20) is satisfied 
for some positive a. 

For the proof of the second part of the theorem, suppose that x n > 0 and 
that £„ is the greatest X{ . Then with a > xjc , but approaching xjc , ${xJo) 
becomes infinite with sign opposite to that of a. Furthermore, in 2i p(xja), 
the positive #'a < x n have a negligible effect; and thus lim 2^(a:,/a) t as 
a -+ (xjc) + 0, is infinite with sign opposite to that of a, when this sum 2 
is taken for the positive ic's, Likewise, if Xi < 0, and is the least ap , lim 2)p(x{/a) } 
as a —> (—Xi/b) 0, is infinite with sign opposite to that of a, when this sum 

is taken for the negative x’a, If, now, the measurements happen to be all 
positive, we think of a as approaching xjc + 0j and the continuity condition 
leads to an a which makes 2^(x,/a) = 0. Likewise, if the measurements 
happen to be all negative, we use —x\jb + 0. If both positive and negative 
x J s appear, we use the greater of the two ratios -xjh and xjc, 

section 3. some fairly reoular frequency functions 

To illustrate the foregoing theorems in a somewhat general manner, consider 
the measurements, Xi t ■ , x n , and with t = x/a, U = re,-/a, set up the 
function: 


(28) f{t) - Ca" 1 1 U | p (1 + k s t 2 r e~ r < li »•, 

whore, as before, C is a suitably chosen constant. 

Suppose also that 


(29) 

V > -1, 3^0, 

r ^ 0, $ ^ 0; 

and that either 



(30) 

r > 0, a > 0 or 

r s? 0, 2q > p 4- 1. 

j 

Then with <fc(t) 

- l°g/(i)j it follows that, when t 0, 


(31) (*'(() + 1 = (j> + 1) -«**[((• - 2#V(1 + kY)-\ 


I 


t 
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Now the condition 1-0'(1) — —1 would be satisfied if ^r(k) = 0, where 

(32) tf(fc) = rsk i+2 + rsk 9 + (2 q ~ p - l)fc 2 - (p + 1). 

But, under the conditions (29) and (30) ^(0) < 0, and ^(®) > 0, Hence, 
there is a positive k for which tyQc) = 0. Then if fc be assigned this value, 

(19) is satisfied; and by Theorem I, any scale a that the likelihood condition 

(20) may lead to is a mean, But, by Theorem III a scale a will actually exist 

—indeed, for any positive fc that may be used in (29); since the limit of + 1 

is positive as l 0, and is negative as J t \ —> ». 

Moreover, if in (29), the further conditipn —1 < p Si 0 is introduced, (22) is 
satisfied. And, thus, a S Maximum | 1. Also, | t4'(t) | increases with f t J. 

Hence, by (24) et seq., Minimum | a* | ^ a. 

If in (28), we set q 0, s =* 1, r > 0, and confine our attention to positive 
x and t, there is obtained the Pearson Type III. Reference to (32) shows that 
'Jf(fc) ss 0 if fc — (p -p l)/r. With this substitution, 

(33) /(f) = C l <T l f p r Cp+1) *, V - Constant. 

Since 0'(1) = -1, any solution of the likelihood condition is a mean. Here, 
with / > 0, f0'(f) = p — (p + l)f, and f 2 0"(O — 1 = — (p + !)■ Prom (14) 
we see that, with p 4-1 > 0, any mean obtained corresponds to maximum likeli¬ 
hood and the single maximum found is actually the largest value. Moreover, 
with the measurements, Xi, Za, • • • , as nj all positive, a scale a will exist,—as 
noted in the general case (28). 

In passing, it may be noted that Type III appears 6 rather naturally in a 
form giving 0'(1) » — 1 at once, without any transformation. Here, then, a 
scale is a mean. 

Given the Pearson Type I in the form 

(34) f(t) - CaT l (b + htY(c — ftf) a , t = x/a ) b > 0, c > 0, | pq \ > 0. 

If V + Q + 1 > 0, it is possible to find a positive h so that with 0 = log/, 
0'(I) — — 1. In this case, any scale found by the likelihood condition is a 
mean. With k thus chosen, /(f) has essentially the same farm as it would have 
if fc — 1. Hence for convenience, let us simply set k = 1 in the above equation. 
Then for —b < t < c, 

0(f) = f0'(f) + 1 = 1 + pt{b + f)“ l - fff(c - ty\ . 

Suppose now that p > 0 and q > 0. Then Theorem III may be applied; since 
lim 0(f) ~ 1, as f —► 0; but lim 0(f) —> — <», as i ~► ~b -p 0, or as f —► c ~ 0. 


1 Carver, H. C., Handbook of Mathematical Statistics, Chap. VII, see p. 105, Line 4, 
noting that <t>' = y’/y. 



18 


UPWARD L. DODD 


Hence a scale a satisfying the likelihood condition exists, Moreover, the likeli¬ 
hood is at a maximum; since, with < t < c } 

tWil) - 1 = ~pt\b + 0“* - q?(c - 0“ a " 1 < 0- 

This maximum is also the largest value for all values of a. 

If the Pearson Type IV is given in the form 

(35) f(t) = Ca _1 (l + k 2 ?r e Q Qro tan t - */o 

then if p > 1/2, it is possible to find a positive k which will make = -1. 
In this case, any scale a is a mean. Moreover—for any k 7* 0—the limit of 
W{1) + 1 is 1 for t -4 0 and is I - 2p for t -+ «. Hence, by Theorem III, 
if p > 1/2, as above, then a scale a exists satisfying the likelihood condition (20) * 


SECTION 4. FREQUENCY FUNCTIONS WITH CERTAIN PECULIARITIES 

The theorems of section 2 give sufficient conditions, which in some cases 
may not be necessary. Nevertheless, by violating certain hypotheses, particu¬ 
lar functions may be set up which exhibit various peculiarities. 

For the Pearson Types, the differential equation is 



y'it) __ .//a Co + a d 
y(t) ~ * w " 6, + b[ + &/ 


t = x/a. 


The determination of a positive scale a by the Fisher likelihood process is 
impossible here, in case a 0 = 0, ai > 0, bo + M + fat 1 > 0. For in this case 
14 f (l) ^ 0; and thus (20) cannot be satisfied, The U-shaped Type II curves 
arc in this class. Likewise, if q 0 ^ 0, £q = 0, and ho + hji + b g t a > 0,—for 
example, with h > 0, b\ < 4h 0 h 2 and the measurements all happen to have 
the same sign aa a 0 , such scaling is impossible. 

For the purpose of constructing peculiar functions we may take c > 0 and 
require that the measurements s f be either — c or c^with at least one — c and at 
least one c—and that £(i) be an even function. Then 0(—c) - <fi(c) and (11) 
becomes 

(37) L « \CcT' e* {eh Y. 

The likelihood condition (13) reduces to 


(38) 0 = 0(t) * (0 + 1 = (c/aW(c/a) + 1, 

with the right member an even function of c/a. And from (14), a maximum 
likelihood is indicated when 


( 39 ) (c/a) 1 V"(c/a) - l.< 0, 

with the left member likewise an .even function. A minimum likelihood is 
indicated if the left member ia positive, 
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Let us apply this to the case where 

(40) HO - (-2/3) log (1 - 3 1 1 1); t*'(Q = 2111 (1 - 3 | i|)“\ 

The likelihood condition (38) is satisfied onlj r when i — d=l. Also <^(1) = —1. 
Thus the only means are the internal means ±c; and the only scale conformable 
to (38) is a = c. But this has minimum likelihood; since 1 — 1 = ^ > 0. 

For positive t, this function (40) is a Pearson Type. 

Consider next a function of the form (28),—with p = -1,25, g — -0,5, 
however,—for which (31) becomes 

( 41 ) < 4 .'(<) + 1 = - 1/4 - <74 + < 7(1 + <’) = -(1 - < V / 4(1 + < ! ). 

whence <£'(1) = — 1, = +1, 4>'"(1) — —3. Here the likelihood condi¬ 

tion (38) has but a single absolute solution 1 1 | = 1, leading to the single scale 
a = c, and to the two internal means, ±c. But, in this case 1 •</>' / (l) — 1 = 0, 
so that 3 2 log L/Ba 2 *= 0. Moreover, for t — 1, B 3 log L/da 3 = a~ 3 ^ 0, Thus, 
the only scale obtained by the likelihood method (38)—viz., a = c— has a 
likelihood which is neither at a maximum nor at a minimum. 

Another anomalous function is that given by 

(42) HO * t ~ 2.5f 2 , t = ±c/a. 

The likelihood condition (38) leads to 

HO = (1 - * 2 )(1 - 4 * 2 ) = 0 . 

j 

The only solutions are t = ±1, giving internal means =fcc; and i = d= 1/2, giving 
external means ±2c. And from (39) et seq,, it can be shown that the internal 
mean and scale, a = c has minimum likelihood, while the external mean and 
scale, a = 2c, has maximum likelihood. 

But it will be noted that a maximum value for a vicinity does not always 
signify a largest value for the entire possible range. Indeed, for the function 

(42) , a = 2 c has maximum likelihood without having the largest likelihood. 
To avoid such an anomaly, a necessary condition is that as \t\ —> °°, 
HO —► — « ; as seen by taking the logarithm of L in (37), noting that as a -> 0, 
(-log a) -► +». 

Finally avoiding the anomaly just mentioned, let us set up a frequency 
function, using the HO in (38), and writing 

HO = 1 + ty'(0 ~ (I — 2i 2 )(l - £ a )(l - 0.9f 2 ). 

From this it follows readily that s 

(43) , HO = K - 1.95! 2 + 1.175t 4 - 0.3 1\ K = Constant. 

This, with U = ±c/a, leads to an internal mean or scale a = c with minimum 
likelihood, a nearby scale a - c with maximum likelihood—differing 

indeed only slightly from, the minimum just mentioned—and another scale 
a = cy/2 having maximum likelihood, and this likelihood is indeed greater 


i 
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than that for any other positive value of a. The external moan a - c\/2 
in this case has the largest likelihood. This may be checked by the use of 
the logarithm of L as it appears in (37), in which the important part is 
<fi(c/a) - log a, 

In passing it may be noted that if f (l) has the form = (1 - i)Il{i) } with 
H(l) f «, and k - Xijo ,; then any solution a of the likelihood condition 
^(i) = 0 is a meaiij—by Theorem I. 

SECTION 5. SUMMARY 

When the E, A, Fisher likelihood method is used to find an “optimum” scale 
for frequency functions, it sometimes happens that this scale is a well known 
mean or at least is a MWtofa mcan-Sec Equation (5). Or a simple trans¬ 
formation (15) may often put the frequency function into such a form. Con¬ 
ditions are given under which a scale will be a mean. Under further condi¬ 
tions this mean will be internal—at least as regards absolute values. Finally, 
under certain conditions, a scale will exist, 

But for certain functions not satisfying these conditions, anomalies appear. 
The scale given by the usual likelihood condition may be a scale with a minimum 
likelihood. Sometimes the likelihood will be at neither a maximum nor a 
minimum. In certain simple cases, no scale exists. Furthermore, it may 
happen that the scales which are internal means have minimum likelihood and 
those that are external means have maximum likelihood. Among Pearson 
Types are found both anomalous functions and functions which would bo 
regarded as regular as regards maximum likelihood, 

In this problem of scaling, likelihood is proportional to a foskrion probability 
with the a piori probability taken as constant, 



MOMENTS OF ANY RATIONAL INTEGRAL ISOBARIC SAMPLE 

MOMENT FUNCTION 

By Paul S. Dwyeh 
Introduction 

j 

The problem of moments of moments has been investigated by a number of 
authors. The assumption of an infinite universe (or that of a finite universe 
with replacements) permits the application of the “algebraic” method, the 
method of semi-invariants as introduced by Thiele (1) and developed by C. C. 
Craig (2) and the combinatorial analysis method introduced by R. A. Fisher (3) 
and used by N. St. Georgescu (4), A combinatorial analysis method has the 
particular advantage that it enables one to compute separate terms of a given 
formula. 

The formulae for moments of moments have been simplified through the 
use of new moment functions. Thiele introduced the half-invariant (1) which 
resulted in considerable condensation. More recently Prof. R. A. Fisher (3) 
has introduced the sample function k whose expected value is a half invariant. 
The most compact formulization presented thus far is his formulation of the 
half invariants of the sample k r in terms of the half invariants of the universe. 
This very compactness, however, makes it difficult to compare results with 
those expressed in the more conventional sample functions. Dr, Wishart has 
written a paper (7) in which he shows, among other things, how the Fisher results 
can be translated to the more conventional (Craig) results and vice versa, but 
such translation is in general no simple matter. It appears that the Fisher 
results are not immediately useful to the statistician who desires the formulae 
to be expressed in terms of the usual sample moment function. On the other 
hand the Fisher formulization is a remarkable discovery toward that harmony 
which must be naturally inherent in the field of moments of moments. Soper 
(6, 111) expressed the general situation when he wrote, “If the terrifying over¬ 
growth of algebraic formulation accompanying this branch of statistical inquiry 
is destined to have a chief utility in induction and going back to causes, then 
perhaps Dr. Fisher's way of estimating a sample will prove to be most fertile, 
but if it is to be applied to problems of deduction, say to problems of suc¬ 
cessive eventuation such as propagation, then Mr. Craig's plain moments seem 
to have a firmer hold on the exigencies of time.” 

It would appear then that the Fisher formulae and the Craig formulae are 
both needed. Georgescu (4) showed a partial connection between them in 
applying to the m functions a combinatory analysis somewhat similar to that 
applied by R. A. Fisher to the k function. It is the purpose of the present 

21 
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paper to work out a combinatorial procedure for a more general sample function 
so that either the Fisher or Georgescu combinatorial results come out as special 
cases, In making such a generalization no limitation is placed on the sample 
function except that it bo rational integral and that all terms are of the same 
weight. Thus the results are applicable to m n m T + k r) m r lt r , etc, as well 

as to tn r and K although they arc not applicable to V m r or in this way 

the important formulae for the moments of a new sample moment function 
will be available by simple substitution as soon as any such new function is 
defined by a rational integral isobaric expansion of power sums. 

It is thus the purpose of this paper to determine the moments of a general 
moment function of the sample. This is done by keeping the multipliers of 
the various partitions of power sums indefinite until all manipulation, is complete. 
It is then possible to assign the definite values of these multipliers which are 
associated with the desired sample function and to obtain the moment of 
the desired moment function in this way. Thus the Fisher result k( 42), and 
the Craig result # 11 ( 1 / 4 , w) are special cases of the new result \ u (/ 4 , f%)- It 
is obvious that it is not possible to carry the results using these general moment 
functions as far as Fisher and Wisharfc (3), (5), (7), have carried the results of 
the decidedly advantageous (from the standpoint of simplicity of Tesult) k func¬ 
tion and yet it ia surprising to find the simplicity which can be obtained in 
the general case. Incidentally the introduction of the more general symbols 
clarifies the successive steps of the partition analysis which are somewhat con¬ 
fusing in any specific ease because of the insertion of the value of the coeffi¬ 
cients of the power sums in which the sample moment function is expressed. 

This paper is divided into three parts. The first part includes the necessary 
definitions, the basic formulae, and the general development of the algebraic 
method. In order to facilitate the algebraic work there is inserted a table giving 
the expected values of all possible partition products of power sums whose 
weight 5>8. The second part deals with the different sample functions which 
might be used. The third part gives a list of the various partition formulae, 
of weight ^8, which contain no unit parts and shows how these can be used in 
writing the chief variations of the formulae for moments of moments. 

Part I 

h 

1. General Moment Functions. Different moment functions have been de¬ 
fined in various ways, but all moment functions have in common the property 
that they may be expressed in terms of the power sums, It appears sensible 
to use this expression in terms of power sums as the working algebraic definition 
of ffioment functions. For example the function h, which is defined by ft. A. 
Fisher to be that function of the sample whose expected value is the third 
cumulant (half invariant) is to be given the working definition of 

h =_n®_ 3 ( 2 ) u) ^ 2 d) a) (i) 

. (n - 1) (n - 2) (n - 1) (n - 2) r n(n - 1) (n - 2) 
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where the numerical expressions in parentheses indicate power sums of the 
sample. 

Every term in the definition of a sample function has a "weight” which is 
equal to the sum of the power sums whose product is indicated by the term. 
Thus the weight of each of the terms of h is 3. If all the terms of a given 
moment function have the same weight, the function is called isobaric and 
the weight of the function is equal to the weight of each term. Thus k 3 is an 
isobaric moment function and its weight is 3. Since all the functions so far 
proposed are isobaric we limit this generalization of moment functions to iso¬ 
baric moment functions although it is possible that a more complex analysis 
could be worked out for nondsobaric functions. 

Generality demands the inclusion of every possible partition product of 
power sums. Such generality can be obtained by writing 

h - al(l) 

U = Oa(2) 4* flu(I) 2 

/a = Ua(3) + a 21 (2)(l) + am(l)* 

fa ~ 04(4) + o«i(3)(l) + a£a(2)" + 021 *(2)(l) 2 4- au(l) 1 
and in general 

f r = 2 M" (v*Y l • ‘ ■ ip»Y' 

where (pi) ri (ptY 2 ■ • ■ (p,)*' indicates any partition product of power sums, 
app. p*! is its coefficient and the summation is taken for every possible parti¬ 
tion. The number of parts of the partition is p = Sir. It may be assumed, 
without loss of generality, that tho partition is ordered, i.e. 

pi ^ V* ^ Vi = * • ■ £ V* • 


A natural numerical coefficient of each term is the number of ways the r 
units can be collected to form the given partition. This value is given by 


l r 


■1 


ys pf 1 • ■ • pi'/ (pi!)" wr • • • (p* 0" I ir 3 1 »• ■ ir«! 


If we sot 


l r 


••• pi* — 


pV vV, 


Upf! ... 


the definition of f r becomes 


fr = 2 ( T ) a f? 1 "■ pT* (Pi) 

\Pi • • ■ Pf / 


W 


In the present paper the capital letters are used to represent the corresponding 
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functions of the universe as defined by the corresponding power sums of the 
universe. Thus 



represents the corresponding function of the universe. In the case of the 
moment about the mean and the semi-invariant the Greek letters ju and X have 
been used to represent the corresponding function of the universe. In tho 
case of functions whose notation is quite widely established, it is preferable to 
use the conventional notation, but in introducing new functions it appears 
wise to use the relationship between small and capital letters since the corre¬ 
spondence between the English and Greek alphabets is not exactly one to one. 
It should be particularly noticed that this notation does not agree with a pre¬ 
viously accepted scheme of using the small English letter to indicate the function 
whose expected value is indicated by the corresponding Greek letter. In the 
present paper it is not the expected value property which serves as the basis 
of notation but rather the definition of the function in terms of the partition 
products of power sums, 

2. The Working Definition of Moments About a Fixed Point. The sample 
functions defined by 


nh ~ 





> (3) 

nia — — , 
n 



' n 


are obtained from/- by placing 

^ when « » 1, ti s 1 , and pi *= r , 

71 

0 in all other cases. . 



The Greek p* is used to indicate the corresponding function of the universe. 

3. The Working Definition of Moments About the Mean. The moments 
about the mean are defined by 


m[ = - 1 -, 
n 


% 


_ ( 2 ) _ ( 1 ) ( 1 ) 


n 


n 


a } 


ms a 


(3)_3(2K1) 2W 

n n ! *" n s ’ 




_ (4) _ 4(3) (1) , 6(2) (l) 5 

,2 + ' 


3(1)' 


n 


w- 


n* 


n* 
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and in general m r is obtained from/ r by placing 

f 1 • 

- if s *= 1. tti =* 1. and pi = r, 
n 1 

(-1)' 1 . 

—^ if pi > 1, n = 1, s =* 2, and p 2 = 1. 

P T* - J W 

-—--- if pi = us 1 and iri = r. 

n T 

, 0 in all other cases. 

The corresponding moments of the universe are indicated by the conventional n. 
For conciseness moments about the mean are referred to as "moments. 5 ’ 

4, The Working Definition of the Half Invariants. The half invariant 
moment functions of Thiele, as applied to the sample power sums are [see C. C. 
Craig (2, 7-10) and Frisch (12, 20-21)]. 

f _ (1) , _ (2) _ 0)0) , _ (3) _ 3(2) (1) 2(C 3 

1 n ’ 2 % n 2 5 3 n n 2 n 8 

, (4) _ 4(3) (1) _ 3C2) 3 12(2) (l) 2 _ 6(1)_ 4 

1 n n 3 n? n* 


and in general 


, ^ (-ir‘( P - i)i / 

' " ft' 



W" w* 1 • • • «'• 


so that 


OjjP ... jj** 




The corresponding moments of the universe are indicated, after Thiele ‘(l) 
and Craig (2), by X, R. A, Fisher (3) used a while Georgescu (4) used s. 

In the present paper these functions are referred to as "Thiele moments.” 


5. The k Functions of R. A. Fisher, The k statistics of R, A. Fisher are 
defined in terms of the sample power sums by 


k[ = ^ fa = 
n 


( 2 ) 


( 1 )’ 


n — 1 »(n — 1)' 


n(3) _ 3(2) (1)_2(l) a 

“ <» - 1) (n — 2) (ft — 1) C» — 2) » w 

, «(n + 1) (4) 4(n + 1) (s) (1) _ 3(2/ _ 12(2) (l) 1 _ 6(1/ 

** " 1)« (ft - I)® (n - 2)<® i ' (ft - 1)<® ' 
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These values and values for h and h are given by R. A. Fisher (3, 203-4) 
while algebraic methods of attaining them are presented in sections 16, 17. 
They are referred to as Fisher moments. The corresponding functions of the 
universe, if used, would be represented by K r . 


6. The h Function. Just as Fisher introduced a sample function whose 
expected value is a Thiele moment of the universe, so it is possible to introduce 
a function whose expected value is a moment of the universe, Such a function 
is defined by 

• *{-<» (2) w 


n 


n 


1 n (n — 1) J 


Ita = 


n( 3) 


3(2) (1) 


(n - 1) (n - 2) (« - 1) (a - 2) 


+ 


2(l) a 


n 


( 3 ) 


r ^ - 2n + 3) (4) 4(ft 1 - 2 n + 3) (3) (l) 3(2n - 3) (2) 1 

li (n - 1)< 3) 


n 


( 4 ) 


+ 


6(2) (X) 2 3(1) 4 

(n - l)w n«>' 


Methods of obtaining the expansion of this function in terms of power sums 
1 are presented in section 18, The corresponding function of the universe, if it 
Were used, would be represented by Hr. 


7. Other Moment Functions, It is possible to obtain an indefinite number of 
moment functions. For example one might define a function of weight 2 whose 
variance equals fit, (or /4), It is possible by the methods of this paper to 
find expressions for such moments. 

For reference purposes Table I is provided showing the values of a for each 
partition of weight <6 for the functions m', m, l, h, fe, The values of 

t. r ) . 

W'vV pJv 

are also inserted, in the left'hand column, so that it is possible to read from the 
table the values for / = m rt m rj l r , k r when r < 6. 

8, Products of / Functions. The product of two or more isobario functions 
is also iso baric and of weight equal to the sum of the weights of the functions. 
Thus 

Aft = M) + a u (l)(l)](a v (l)] = ^(2) (1) + ai>ai(l) ! 

/*/i = mlmaf + anatd) 4 . 

In multiplying f Tl by f Tl any. term of / ri is of weight and when It is multi¬ 
plied by any term of weight n, the result is a term of weight n + T% , 
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TABLE I 

Coefficients of Products of Power Sums in the Expansion of Different Moment 

Functions 


Numori- 






----- 

cal 


/ 


Jr 



coefli- 

a 

m 

r 

m r 


K 

cient 





1 


1 

aj 

i 

1 

1 

1 

1 


■ 

n 

n 

ft 

n 

ft 

1 


1 

1 

1 

\ 

1 

1 


n 

n 

n 

71 — 1 

n — 1 

1 

an 

0 

-1 

-1 

-1 

-1 



n 2 

ft 2 

n {i) 

ft (2) 

1 

a 3 

1 

1 

1 

n 

n 

n 

n 

ft 

(n - 1)<* 

(ft - 1)< 2 > • 

3 


0 

-1 

-1 

-1 ' 

-1 


ft 2 

ft 2 

(n - I)® 

(n - 1)« 

1 

dm 

0 

2 

2 

2 ' 

2 

ft 3 

n: 3 

ft (3) 


1 


n 

1 

1 

ft(ft + 1) 

ft 2 - 2ft + 3 

n 

n 

n 

(n - 1)«> 

(ft - 1)W 

4 

flai 

0 

-1 

?i 2 

-1 

(n + 1) 

(n - 1)< 3 > 

ft 2 - 2ft + 3 

ft<« 

3 


0 

0 

-1 

-1 

2ft — 3 

an 




n 2 

(ti - 2) (2) 

ftW) 

6 

Will 

0 

1 

2 

2 

i 

ft* 

ft 3 

(ti - 1)« 

(ft - 1)» 



0 

-3 

-6 

-6 

-3 

1 

dim 

n 1 

n* 

n (4) 

ft<*> 

1 



1 

1 

ft 2 (ft + 5) 

ft(ft 2 — 6ft 4-10) 



n 

ft 

(ft - 1)< 4 > 

•w* 

i—1 

1 

5 

d41 



-1 

ft 2 

ft(n + 5) 

(n - 1)<‘> 

ft 2 - 5n + 10 
(n - lj» 

10 

Gw 

0 

0 

-1 . 

n(n — 1) 

ft — 2 

n 1 

(ft - 1)« 

(ft - 1)» 
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TABLE I —Concluded 


Numeri¬ 

cal 

coeffi¬ 

cient 

a 


m r 

l r 

Hr 

hr 

10 

dan 

11 

1 

n 3 


2(n + 2) 

(ft - 1)« 

ft 2 — 4ft + 8 

ft< 8 > 

15 

(hn 

0 



2(?i - 1) 

(ft - 1 )< 4 > 

t (2ft - 4) 

+ ft« B > 

10 

02111 

0 

■-1 

-6 

6 

. 1 

n* 

ft 4 

(re - 1)»> 

(re - 1)«> 

1 

ttmu 

0 

4 

24 

ft 5 

sl 

4 


R. A. Fisher [3, 207) used the product k% as an illustration of the algebraic 
method. The more general ftfi gives 

flh — [tta(3) + 3fl5i(2)(l) + aiii(l) a ) 2 [fla(2) + an(l)(l)] 

= 4^(3) (3) (2) + a 3 0 'i .(3) (3) (X) (1) + flaum^) (2) f (l) 

+ [OaaOijia]] + 2daG4am](3)(2)(l) a + 9Q2iGi(2) a (l) 2 -f- 2a3amGn(3)(l) & 

1 + [ 6 fl 2 iflma 2 + 9(i2iftii](2) 2 (l) 4 + [OflaifliiiGn -f- fl2®in](2)(l) 8 -f a iii fl n(l) e 

which reduces to the value as given by him when the values of a are substituted 
from Table I. 


9. The Expected Value of Any Partition Product. The expected values of 
partition products are well known and are indicated by 

‘ Efa) -* nn' pi 

^(pi)(P2j = ftfiipi+pj H“ ft (ft —* 

^(pi)(ps)(ps) = ftMpi+pj+Pa + n(n - I) [iip 1+fii np 3 + v'pi+ptfjipi + Ppi+Pifi'pi] 

+ n{n — 1 ) (ft — 2) jup^Pp, • 

and in general 


ww ■ ■ ■ (p,y = 2 n<ri 


pV pi' ■ . ■ 
,qV g*' ■ ■ ■ 


?:■ 

gf 


, &i) w • • • (n,)" 


where r — xi,+ Xe + xa + • - * + Xt and 




,«F ql‘ 




XI 


indicates the 


I 
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number of'-ways in which the partition p* 1 pp ... pp can be grouped to 
form the partition qP qP • ■ • ql l . 

The continued application of the result above leads to a large number of 
formulae, In order to make these results accessible I present in Table II the 
expected values of all partition products of weight 58. The essence of the 


table is the evaluation of the expression 


vV vl l '' • 

-jfi „Xi 
,?1 $4 * 1 ‘ 



The numbers 


at the top of each column indicate the subscripts of the ju's which must, of 
course, be multiplied by n [fl . The entries on the extreme left are the numerical 
coefficients associated with each row. 


10, The Expected Values of the / Functions. With the use of Table II one 
is able to write expressions for the expected values of f r when r < 9, 

Mi (/0 = Wi) & aiftpl 

Mi( ft) - = (a? + flu)n/ir+ <*iM n ~ iVi 2 

Mi ifs) = $(fs) — (as + 30si + + 3(fl 21 + < 2 tn)n(ft — l)/iaMi 

+ am n(n — l)(?z — 2 )ju[ a etc. 

If the expected values of the / functions are expressed in terms of the moments 
about the mean of the universe, these formulae become, since pj «= 0 

h!(/i) - 0 

Mi (/a) = (®4 + Un)7iMa 

Mi(/a) = (&a ~b 3 oai + a m)nMa 

Ml(/ 0 — C®4 ~b + 3oa2 -b 60an H" aun)wM4 

-b 3(022 + 20211 ”H aim)w(?t -* 1 )m? etc. 
These may be written more symbolically as 

= o 

’ Ml(/4) “ ?>2WM4 

Mi(/a) = 

MiCA) = bi 7 i{ii -b 3 bvn(n l)fij etc, 

U. The Expected Value of Products of f Functions. The expected value of 
products of f functions may be similarly found. For example 

«(/i) “ E(fi) = E[«s(2) 4- ou(l)*f - o|B(2)’ + 2a,ouE(2)(l)(l) + ouE(l)‘. 





























































weight = 8 
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TABLE II —Continued 

n I n^\ n«H I aW n w J n W) 

6 I 51 | 42 I 83 1411 32l| 2 * 31*1 2 s 1*1 21* l 1 



weight >- 7 



321* 231 | 31* 2 1 1« 21* l 7 
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Table II can now be used by indicating a\ as a multiplier of E(2f > 2a^au as a 
multiplier of I?( 2 )(l)(l) and ah as a multiplier of (l) 4 . Then at once it is 
evident that 

Ms (/a) — (dl 4 204011 4 a\i)nfn 4 4 2 a z a n -f- 3an)ft(ft — 1 )m 2 

— (a 2 4 au) s fifii -p [(a-i 4 Ou ) 2 4 2ah]n(n — l)pl 
1=5 blufii 4 ( 6 8 -j- — 1 )m2< 

Similarly 

M 11 C /3 j fi) = hbzUfib -f- (6A 4 3bnh -f- 0&2i&u)ft(ft — I)m3Ms 

Ms(/a) = 4 " (6621 4 663621)71(71 — 4 “ (63 4 9621)71(71 — 1)^3 

4 {%l 4 6 fi!a)n(« - l)(n - 2 )nl 

etc, 

I 

where 63 — cl 3 4 3ctai 4 Q-m ^ 621 = am 4 anii 6 m — am. The important 
special case 3 are obtained by assigning the proper values to the a's as given 
in Table I. Thus 

Mi(m 2 ) = i [(ft ~ 1) ? M4 4 (ft 2 - 2ft 4 3) (ft — 1 )mz] 
w 

which agrees with the corrected result of “Student" in 1908 ( 8 , 3) and Tchou- 
proff (10,102). Similarly 

lin(m 3) m) = ~ [(ft - l) 2 (ft - 2)w 4 (ft - 1) (n - 2) (7 ? - 5 n 4 10 Wd 
w 

M 2 (W ** i [(« — 1 ) 2 (n — 2 ) 2 jifl 4 (—6 ft 4 15) (ft “ I) (» — 

w 

4 (ft 2 - 2 ft 4 10) (ft ~ 1 ) (ft - 2)V 3 + (9ft 2 - 36ft 4 60) (ft - 1 ) (ft - 2)fi) 

etc. 


In the same way 


t n ^ /14 , (ft 8 — 2ft 4 3 )ju2 

" i(fe) - u + 


Mil(&3i ^s) — -7 4 " 


ft(n — 1) 

Me . (ft 2 ~ 5ft 4 IOW 2 


n 


n(n - 1} 


z n ^ pa t ( —Bft 4 15)w*2 1 ( ?l " — 2ft 4 10)^3 ( (9ft 2 ~ 3 6 ft 4 6 Q) Ms 
= ft + ~ ft (ft -^iy _ + ~ ft ^ - 1 ) + ft(ft — i) (ft — 2 ) 


n(ft — 1 ) 
etc. 
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faijTh) — '— H - (r 1)^1 

n 

1*11(^81 ^2) = - [m* 4 " 

n 

mi(^* i) = - (m® 4 ~ — i)wl 

n 

etc. 


12. The Expected Value of the Products of / Functions in Terms of the 
Thiele Moments of the Universe. The formulae giving the A in term if the 
X'b are 

M 2 = h 

f 

Ma = X 3 

M 4 *= X< “h 3 X 2 

Ms = Xj ~h IOX3X2 

Ms = Xf 16X4X2 4 * 10 Xa 4 ~ I6X2 



where the summation holds for those partitions having no unit parts. See 
the results of Craig ( 2 , 7-11) and Frisch (12, 21). It is at once possible to 
express the moment formulae in terms of the Thiele moments of the universe. 
Thus the general results above become 

lii(fi) ~ & 2 WX 4 4“ [ 3 X 321 4~ (62 4“ 26|])w(7i — l)]Xa 

Mu(/»j h) = 4“ [lObahiU. 4 * (W >2 4" ^ 621 X 2 4“ 6h 21 X J i)n('^ — l)]XaX 2 

m!(/s) = bln\t + [16bl» 4- (96|i 4- 6 bMn(n - l)]XA a 

4~ [lOXjtt 4" (Xf 4~’ 9 X 21 ) 71(71 — l)]Xa 

4“ [156aft 4 - (276s! 4- 18X 8 X 2 i)tt(n — 1) 4~ (9E>at 4- 6 Xlu)7i( 71 —' l)(tj — 2 )]X 2 , 


13. The Thiele Moments of the fs in terms of Thiele Moments, It is 
now possible to reduce to the Thiele moments of the /‘s by means of the usual 
relations ' 

Mfr) - Mr) ~ Att 

Xn(/ru/rj) Mljf/rii /rj) " Mlfl(/ri >;/r 2 ) 

• X3(/ r ) == Ms(/r) “ 3M2(/r)Ml (/r) 4“ SjUl^/r) 


etc. 
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so that the results become 

hi(fi) = 4~ 2 [bln 4~ b\\n{u — 1 )]X 2 

= 63&271X5 + {3[6 3 b 2 n 4- & £ i6 2 n(7i — 1)] 4- 6[b 3 62n 4~ bnbun(n — 1)]}X3X2 
M(fs) — bsJiXg 4- -j- bgbftiityi —* 1)] -f- -f- bh?i(n — 1)] 1 X 4 X 3 

•f 9 [ 6 a *n + btMn - 1 )]X? + { 9 [fo + 25 a 5 21 ft(ft - 1 ) + bln{n - 1 ) + btm l3) ] 
4” 0[baft 4* 31>2in(7i — 1) 4" &i u.n{n — l)(?i — 2)]JXg 

etc. 

The formulae as written are adapted to the partition representation of Part III, 
When the fs are equal to the m ’s we have 


X 2 (^) = 


_ (ft - l)\ . 2(11 - 1 )X 


n 3 


4 - 


n l 


Xn(w 8 m) = — — 1 6(ft ~ 1) (ft - 2)X 3 X2 

} ft 4 ft 3 


w , = fa - I) 2 fa - 2)*X, 9fa - l)fa - 2)V S 

' ft* 


n° 


, 9(ft - 1) (ti — 2) 2 Xa , 6(ft — 1) (ft — 2)Xz 
4-n- \ - 


IT 


W 


etc. 


which are the results as previously given by C. C. Craig (2, 55). In like manner 
when the f T = k T 

Uh) = | + 

ft ft — 1 




X.fa.) = ^ + 7 —- 

ft ft— 1 ft— 1 (ft — 


6 n X 2 


(ft — 1) (ft - 2) 


etc. 


as given by R. A. Fisher [3, 210] while 

Xj(?ft2) — -(X4 4 - 2Xfi) 


Xu(tfti) iftj) ~ —■ (Xg 4 - OXjXj) 
ft 


Xi(ftla) — - (Xa 4“ lSXtXi 4“ 4“ 15Xi) • 


etc. 
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14. Various Formulization of Results. Although different moment functions 
of the universe may be used it is customary to express the results in terms of 
universe moments about a fixed pointj in terms of universe moments, or in 
terms of universe Thiele moments. It is possible to express results in any of 
the nine forms 


v'(fr)\ {moments about a fixed point (/) 

fi(/ r ) > in terms of l moments {(i) 


MWj 


Thiele moments (X) 


where f r represents the isobaric sample moment function of weight r. One 
purpose of such varied formalization is to discover the most compact form 
and also the one best adapted to use in the ease of a normal universe or a uni¬ 
verse whose moments obey some discoverable law. As suggested above Craig 
( 2 ) has shown the relative compactness obtained by using X(m r ) and Thiele 
moments of the universe while It. A. Fisher (3) has shown the great additional 
compactness obtained by taking j T = fc r , 


15. The Application of the Algebraic Method to Aai(/a, /?). Before leaving 
the algebraic method it is perhaps wise to outline the steps in the case of a 
more involved problem. We take the example which R. A. Fisher (3, 207) 
lms used in the case ia which /, = fc,. To find a 21 (/ 3 ^ fa), 

The value of flfo was found in section 8 , To find its expected value it is 
only necessary to enter the coefficients of the different partition products in 
this expansion at the left of the corresponding rows as indicated in Table II. 

The coefficient of any moment partition of the universe is found by multi¬ 
plying each column entry by its corresponding left row entry and then by 
multiplying by n' f) as indicated at the top. Thus the coefficient of mb is 


(a^fls *h ajQu d~ 603821*2 d - 6838518 !! -f~ 2838 m~b Drains d' 2838111811 T- 6858218111 


*t* 6*21821811 + fiflsittmfln ~h 811182 d - 8111811)71 


which after some algebraic work reduces to 


(*a + 3fl 21 + am) 2 (aa H- Cn)ii = blb^n. 

In this manner it is possible to write the result either in terms of universe 
moments about a fixed point or in terms of universe moments, If moments 
are used, one may neglect all column partitions involving unity. 

It should be noted that the g's defining k r as given in Table I can be inserted 
here if desired. If these multipliers are introduced throughout the rows and 
columnar partitions involving unit parts are not used one will arrive at Table I 
of R, A. Fisher [3, 208] though there are some slight typographical errors in. 
his rows for (3 ) 2 (l) z and ( 3 ) ( 2 2 ) ( 1 ), 

Determining all the coefficients in this manner we find after considerable 
algebraic manipulation that 



MOMENTS OF ISOBAIUC MOMENT FUNCTIONS 


37 


J/-i) = 63^2^8 4" [&a^2 + tl&2|ba + 12il3^21&ll 4" 6&362lb2]'n{7l — 1 )jHbM2 

4~ [2b 3 b 2 18b 2 ib 2 -|- 18b|ibn 4" fibabsibz 4- 12b 3 b 2 ibji]7J.(n ~ })/ib fta 

4" [Sbjbn 4- Ob^ibs 4~ 1 Si?2itn 4- 6b3b2iba]n(rt — l)/n 4" [3Gb2i&2 
' 4" 54b 2 ib u 4~ Gb a b2ib 2 4~ 1 2&3&zj&ji 4' ]2&abmbu + 72b2ib|]i&u 

4" IS&mbzjtttn — l)(?i — 2t)fiifi2 + [baba 4~ Gbjbjibg 4~ 12b 3 b 2 ib]| 

4" 27baiba 4~ 90b 2 ibn 4* SGbsibm&a 4~ 72b 2 |biiibn 4- 36biu&ulft(ft “ l)(ft — 2)^2 

4" [^baibz 4- lSbiibn 4" 3Gi>2ib]nbu 4~ Gbinbz 4~ 36binbn]n(w — l)(n — 2)(n. — 3 )ju 2 . 

If/r = hr tiie proper values or b arc inserted and the expression above becomes 
that given by R. A. Fisher (3, 208). For example the coefficient of y\ is 

(9n - 63w 2 + 240n - 420) (n - 3) 

nHn - l)Hn - 2) 

when 


b 2 = 


1 



1 

n(n — 1) 


bn = — 


1 


n (n — l) 1 


bin - 


n(n — 1) (tt — 2)' 


The algebraic results involved in changing the general formula above to 
other functions are too extended to present here. A symbolic means of attaining 
them is included in later sections of the paper. 


Part II. The Determination of Specific / Functions 

16. Functions Determined by the b's. In Part I it was shown how various/ 
functions are defined by giving definite values to the coefficients of the power 
sums. It is the purpose of this part of the paper to show how functions can 
be specified by means of their expected values in terms of moments of the. 
universe. This is essentially the method used by R. A. Fisher in defining his 
b function and it is here extended to other functions. In this case the b's are 
first determined and the a’s arc then found from them. The first moments 
of fit ft, fa were given in section 10. To these we add, as shown by Table II 

Mt(/i) — ( a 4 4“ 4#3i 4~ 3aj2 4~ 602 ii 4~ 0nn)n/u 4' 4(a 3 i 4~ 3a 2 ii 4~ “ l)wi 

4~ 3(^22 4~ 2 02 n 4" Ouu)n(n 1 )mk 4* 4" — l)(n — 2 )^. 2 ^ 3 

+ Ouipi(w — l)(n- — 2)(n — 3)>n 4 


etc. 
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These can be written more symbolically in terms of the b*s 
Mi (A) = Mm! 

m!(A) - 6 z«m* + b iL n(n — 1)^ J 

m!(A) = hiiiis 4- 3bnn(n - + bmn(n - l)(n - 2)fi( 3 

/ij|(A) s= 6^4 + 46 3 in(n — 1 )/ijMl + 3622/1(71 — 1)m 2 2 + 662iin t3) M2Mi 2 4 - 67 i <4 Vi 4 r 


and in general 


l r 


\Vi Pa 


*1 


vl'i 


6j>T 1 


»!• n 


(p) 


(m;, 


The expansion of the function in terms of the power sums of the sample demands 
the determination of the a's. This can be accomplished b} r solving the equations 


a\ — 6j 

fla 4" = 6g 


fl u 


611 


+ 3 an + am — 63 

021 am = 621 
am = 6m 


04 4- 4a JL 4 - 3a42 4“ 6^211 + aim = 64 

1 

a »i 4 " 3 ti 2 u 4 " aim ~ 631 
Om 4 " 2o2ii 4 ~ aim = 622 


The solutions are 

Oi =■ 6] 


etc. 



an *= bn 

03 5=5 63 — 36j* 4 * 2bm 

02 i *= 6 ji — 6 m 


am = bm 

04 = 64 — 46 31 — 3624 126 jh — 661111 

® 3 i = bji 36211 4 - 26 mi 

022 ~ 6?2. 26211 4 ” 6im 

0211 = 6211 - 6in^ 

aim = 6 m i- 



MOMENTS OF rSOBARFC MOMENT FUNCTIONS 


39 


The values of a r , at least for r ^ 4, follow the Jaw 


a r 




(-1 r 1 (p —1) I i>uT 



and 


ou = la*ad where l^ai 1 indicates that = fc 2 — 6 n is multiplied by a t = 6 i, 
the rule of multiplica tion being suffixing of subscripts. Similarly Om =* iaW — 
f (&2 — 6 u) (&2 — 6 ll ) 4 = ^22 — 2b m “b & 11 U- 
This statement illustrates a general theorem which will be established later 
in another paper by a different approach that for all cases 


and that 



(-ir v (p - im P [i 


T 


Vw 


4 


€L j > >■!? 

t u 




This theorem enables one to write, with comparative ease, the coefficient of 
any product of power sums in a sample function whose expected values is defined. 
For example the functional coefficient of (3)(2) in/g is 

laa — I (6a — 3 /jat -j- 26 lu) (62 — 6li) 4 = 632 — 6311 — 36221 "b 56 am — 26 um 

while that of (3)(1)(1) is - 6311 - 36aui + Sbum. If the expected value 

of the function is known the b’s are determined and the values of the above 
expressions can be found by substitution. 


17. The Values of the Fisher Moments (h functions). The fc functions have 
been defined to be these functions whose expected values are the Thiele moments 
of the universe. Thus pi (At-) = X r and since 


^ = 2 


,vl'vV 


vl'j 


••• (f/j'\ 


it follows at once that by comparison with pi(/ r ) in the last section, that 


Thus 


6«fi 


3 * * » ™ 


P] P2 4 Pj 


. (-l)’" 1 (p~ 1)1 


fci = -} b. = i; 

n n 


bu = - 


n 


0) 1 


, 1 ,- 1 , 2 
h m n> bnl = ^'> 


, 1,-1 

04 = - ; O aL = ~^r| 

n v> K ‘ 


‘ -1 , 2 
622 =*^>1 4)211 = ^; 


x -6 * 

61111 " etCl 
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Tlie insertion of these values in the formulae of section 16 gives the values of 
a such as those indicated in Table I and in section 5. Thus the coefficient of 
(3) (2) in U is 

10(632 — 6311 — 36221 + 562111 — 26 nm) = “ 10 + ^3} + ^ 

10n (25 

_ (n - 1) (4) ‘ 


The coefficient of (3) (1) (1) is 

[ 2 18 48 "1 

^(i) + ^i) + 


10 (2n + 4) ■ 
(n — 1 ) (4> ' 


18. The h Functions. It is also possible to define a function whose expected 
value is the moment of the universe. Thus ni(h r ) — p r where 


and 


Hr 




I 


1 if s = 1, Ti = 1, and pi = r. 

(—l) n if pi > 1 , 7Ti = 1, s = 2 and p* = 1 . 
(-l) r_1 (r - 1) if pi = 1 } s = 1, and vi = r, 
, 0 in all other cases. 1 


Comparing with the value of p[(/ r ) in section 16 we have 




*!• - 


v r 


p T 3* 


n 


(a) 


The substitution of these values of b in the results of section 16 gives the expan¬ 
sions of h r in terms of power sums as illustrated by the formulae of section 6 
and Table I. Thus the coefficient of (3) (2) is 

10(632 — ban — 36221 56jm — 26mn) 


10 [° + n»> + ° + nW + ^)] 
Similarly the coefficient of (3)(1)(1) in hi is 


-10(n - 2) 
{n - 1)< 4 > ' 


10 ( 631 . - 36 2 hi + 26mn) = 10 \~ + "4 + 4il = ^ Z ~ + ~ 

L» (,) n -n-C6>J n w 

19, The h Functions. One line of attack calls for the introduction of new 
moment functions which will result in simpler formulae. Thus for example, 
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C. C. Craig wrote (2, 37) "It rather seems that the best hopes of effectively 
further simplifying the problem of sampling for statistical characteristics lie 
either in the discovery of a new kind of symmetric function of all the observa¬ 
tions which may be used to characterize frequency functions and which will 
be more amenable than either moments or semi-invariants for use in sampling 
problems, or in, what may very well prove to be much better and more 
feasible, the abandonment of the method of characterizing frequency functions 
by symmetric functions of all the observations altogether.” 

R. A. Fisher has shown that it is possible to introduce symmetric functions 
which do simplify the resulting formula appreciably. It is the purpose of this 
section to introduce an additional symmetric function which simplifies the 
resulting formulae to a much greater extent. It is admitted that this function 
does not have all the properties (such as invariance with respect to change of 
origin) possessed by the Thiele and Fisher functions, but it does not have the' 
property of making the resulting formulae simple. It also has the advantage 
that n(h' r ) = 

The basic idea is to find a sample moment function whose expected value is 0. 
A first attempt, placing every b = 0, is of no avail since every a is also equal 
to 0 and there is no function. A second attempt is based on the idea of finding 
the function h whose expected value is m . If the universe is assumed to be 
measured about its mean, as is conventional, it follows at once that jui = 0 
and A*i(A r ) = 0 so that 

Hpv(Jlr\i ^rj) = j ^rj ) 1 

This function then has the property that its moments about a fixed point and 
its moments are identical. 

In order to discover its expansion in terms of power sums, we note 

Hiih'r) = Hi 

and it follows at once by comparison with ju(/ r ) in section 16 that by - 

and = 0 in all other cases. The a*s are determined in the usual 

way. Thus 

Oi = t>! - bn = - -~_- T) 

0,1 = 

so that 

- - S5TH5 m - (1)(1)1 ' 
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Similarly 

>iS = X\m - 3(2 ^ 1 )+ufl 

n (3) ■ 

i 

ft! = - i 16(4) - 8(3)(1) - 3(2) (2) + 6(2)(1)(1) - (l) 4 ] 

n w 

and in general 

h' r = (-ir l Kp. -1) ir 1 Kp* -1) r •• • Kp. -1) A" 

(p ,) 11 *« * (p*) T * 

j 

In order to show the simple form in which results can be given we substitute 
the values of the b’s in the results obtained above. Not only does ni(h r ) = 0, 
but by section II 



4 

M2 


= Hzihi) = MaC^a) = n (j l . ~Z I) 

Xn(ha, hi) = Mu(ba, h) — Mii(ha, ha) = 0 


^(ha) — Matta) = ^ 2 (^ 3 ) =» 


6 


n{n — 1) {n — 2) 


3 

M2 


while from section 15 


hz) = nm{h 3l hs) =5 MmOte, J 12 ) = 


36 msM2 


"h 


36(n — 3) M 2 


n 2 {n - l) 2 {n - 2) n 2 (n — 1) 2 (» — 2)' 


1 

It is to be noticed that these formulae contain very few terms and that the 
terms themselves involve very low moments of the universe. This simplicity 
lms been attained without making any assumption such as normality, regarding 
the nature of the universe. 


20. Table of Values of b for Different Functions When r < 6. This process 
of defining functions by means of expected values could be extended indefinitely. 
Perhaps it has been applied to enough functions to suggest the breadth of the 
applicability of the theory developed in Part I and Part III. 

As the b’s are the quantities which are used in the formulae I have provided 
Table III giving their values for the six functions, m T) k f} h r) ti r when 

r - 1j 2 , 3, 4, 6 . When the a’s are known, the b’s are computed from them 
according to the formulae of section 16. 
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• TABLE III 
Tai^es of the b’s for r ^ 5 


NumJ 

OOGf. 

a 

1 

Wi r 

Jr 

K 


t 

h 

r 

1 

fit 

■ 

l 

1 

1 

1 

1 

n 

ft 

n 

ft 

ft 

ft 

i 

h 

i 

n 

ft - 1 
n 1 

ft - 1 
n 2 

1 

n 

1 

ft 

0 

i 

■I 

0 

l" ^ 

i 

r 

1 

1 

IB 

n 2 

ft- 2 

ft< a} 

ft<*> 

n (« 

i 

bs 

' 1 

(n — 1) (n — 2) 

(ft — 1) (n - 2) 

i. 

1 

0 

n 

ft 3 

n 4 

n 

ft 

3 

a 

0 

(n-2) 

u 3 

71—2 

ft 3 " 

i 

ft<*> 

1 

“ *<*) 

0 


t 

0 

2 

2^ 

2 

2 

1 

1 

tom 

71 1 


n ii) 

n (4> 


1 

b> 

1 

(ft - 1) (n* - 3n + 3) 


1 

1 

0 

n. 

ft* 

n' 

ft 

n 

4 

a 

1 0 

(tv* - 3ft 4- 3) 

ft 1 

(ft* — 0» + G) 
ft* 

_1_ 

ft [1> 

i 

i 

0 

3 

i >23 | 

0 

2 n — 3 
' ft* 

(n s — 4 n -f 0 ) 

ft 4 


0 

1 0 

6 

&J11 

0 

7 V — 3 

ft* 

2 (ft - 3 ) 

ft 4 1 

2 

n«> 

_1_ 

ft (3) 

0 

1 

6)111 


_ 3 

6 

0 

3 

_L 

n k 

| ft* 

^ n< 4 > 

ftW> 

n w 

1 

6b 

1 

(ft-l)(tt-2 )(n*-2ft + 2) 

(tv—1 ) (ft—2) (ft 1 —I2n-R2) 

1 

1 

0 


ft 5 

ft 5 

n 

n 

•i 

bu 

i 

(n 4 — 4n* + On — 4 ) 

(ft 1 - 14 ft* + 36 n - 24 ) 

•i 

__ jj 

0 

n 4 

ft 5 


n (1> 


fcfla 

0 

ft* — 4 n + 4 

(ft 3 - Sft* + 24 ti - 24 ) 

IHH 

i^Ba 

0 

n 

ft 6 

ft 5 

1 

u 

10 

6»ii 

0 

tv 1 — 3 rc -|- 4 
i %i 

2ft* - 18 ft + 24 

ft 6 

ft< 4> 

71^) 

0 

16 

bm 

1 

0 

_ 2{n - 2) 
n & 

2ft s - I2n + 24 
n s 

A 

n (s) 

0 

0 


,6im 

0 

— 4 

ft* 

0(n - 4) 

ft 1 

A 

n (<) 

I 

0 

1 

&um 

1 

_ - — - - -- 

4 

ft 4 

24 

ft 4 

_24i 

a 

_L 

Tito 

_ 
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Part III. Combinatory Methods 


21, Partition Representation of Expected Value of / Functions. The formulae 

mI(/i) = 

- b 2 nu 2 4 bnn(n ~ fV ! 2 

g((/ 3 ) = banvs 4 3i>2i?i(ft — l)^i + — l)(n- — 2 )mi 3 

p[(fi) = bnnti4 4 4.bs\n{n — 4 3 b^n(n — 1)m2 2 

4- dbinn(n — l)(n — 2 )4 frim^Vi 4 
are ‘'synthetically” given by the column partitions 


1 

2 1 

1 

3 2 

1 


4 3 

1 


1 

1 

1 

2 2 
2 1 

1 


1 

1 


The partition parts represent both the subscripts of the moments and the 
subscripts of the b’s. If p indicates the number of parts, the n multiplier 
is n (t) . The numerical coefficient is obtained by taking the sum of the entries 
in the column (the weight) and dividing it by the factorials of all entries times 
the factorials of all repeated entries jus indicated by 


r 





r! 


(?>l l)* 1 (p2l) T2 Y • (pi !)*' 7TLI TT<i ! ■ ' • 7rJ 


The translation from the synthetic, partition form to the expanded form is 
accelerated if the coefficients are known. These are provided in the following 
partition representation of tire formula for p[{f r ) when r ^ 8 and the results 
are expressed in terms of the moments of the universe 

mUs): 1 

2 

l 

3 



MOMENTS OF iSOluniC MOMENT FUNCTIONS 


45 


wtt): 1 3 

4 2 
2 

Mi(/b) : 1 10 

5 3 
2 

1 15 10 15 

6 4 3 2 

2 3 2 

2 

m'i(A): 1 21 35 105 

7 5 4 3 

2 3 2 

2 

ul(/ 8 ): 1 28 56 35 210 280 105 

8 6 5 4 4 3 2 

2 ^ 4 2 3 2 

2 2 2 
2 

The proper formula can be stated immediately from its synthetic representa¬ 
tion. Thus for example 

+ 155 w n(n — 1 )/liU 2 + 10Wi(n — I)j4 

+ 15b 2! 2n(n - l)(n — 2)^1 * 

22. Partition Representation of the Expected Value of a Product of f Func¬ 
tions. Two column partitions may be used similarly to represent the expected 
values of the products of two/’s, three column partitions for the expected value 
of the triple product, etc. In order to obtain all terms it is only necessary to 
combine every partition of each / in every possible way. The synthetic repre¬ 
sentation of JHCfUi, mi) is 

112 1 

21 20 11 10 
01 10 10 

01 

The sum of the entries in each row indicates the proper moment while the 
number of rows indicates the number of parts as in the preceding section. 
The n coefficient associated with a p rowed partition is then n ip] . The b coeffi¬ 
cient is indicated by the columnar entries. Thus 

PwUiJx ~ b 2 6 i^!+ [bs&L + 2bnbi]n(n - 1 )m ^1 + b n hn{n - l)fa 2)y.\. 
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4f> 

We verify this by the algebraic method 

Ms*,h) - M2) + ««(D(l)!laiWU 

= J?[«mi(2)(l) + flj]fli(l) 3 ] 

= (kai[nH3 4 n(n — l)/isai 

4- fluftiMa 4 3 n(n — l)^ 2 ^i 4* — l)( n — 2)/ii 3 ] 

= (a 2 4 an)aiftpa 4 { 0.2 + On)fli^(w ~ 1 )^ 2 Mi 

4 2auaiM«f<i 4 aucwCn - l)(n — 2 )m1 3 
=■ bzbinn 3 4* bib t n(n — l)v 2 p'i + 2 bnbin(n ~ 

4- - l)(n - 2 )mI 3 

as indicated, 

It thus appears that the partition representation is a mnemonic device for 
indicating the solution as obtained by the algebraic method. A move formal 
justification is based upon the property that if 

JffCfi) - M2) + 6n(l)(l) and WO - MD 

then can be obtained by a symbolic multiplication of b a (2) 4- &n(l)(l) 

by bi(l) where the b’s are multiplied but the power sums are collected in all 
possible ways. Thus 

*U„/i) - W.[(s) + ( 2 )(i)] + M,[a( 2 )(l) + U) J ] 

which gives 

* 

-E(/sj / 1 ) “ bibitt^S 4 b 5 bi7i(?i - 4 2b n bm(^ — 1 )mbp 1 4 bubi»°V! 3 

os before, 

This symbolic multiplication is generally true and serves as the real algebraic 
justification of the partition representation. It will be established in a later 
paper dealing with the more general case of a finite population. The general 
type of partition analysis has been used previously by Fisher (3) and Georgescu 
(4), Each has established it through analytic rather than algebraic means. 

23. Determination of the Coefficients. Methods of determining the numerical 
coefficient have previously been given by such authors as Fisher (3), Wishart (5) 
(7) and Georgescu (4). If the/'s are of different weight, the coefficients of any 
partition (an interchange of rows is not looked upon as changing the partition) 
is given by writing in the numerator the factorials of the different r’s and in 
the denominator the factorials of all the different entries and the factorials of 
ail repeated rows. Thus the coefficient of 


4 1 3! 3 ! 
21(10*21 


= 72 . 


210 

111 is 
111 
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In case two or more functions have the same weight additional equivalent 
partitions are formed by interchange of columns. The reader is referred to the 
above papers for rules for determining the coefficients in the more involved 
cases though the coefficients are presented for all the two way partitions of the 
next section. 

An alternative method of finding the coefficients is that given by C, C. 
Craig (2j 24-25) since it appears that the symbolic formulae used in the present 
paper are essentially his formulae for Vs in terms of Vs. For example his for¬ 
mula for j> 4 4 (2, 22) is given symbolically by the formula for 44 in the next 
section. The only difference revealed is that the subscripts' of the X’s are read 
by rows rather than by columns and that they are sometimes interchanged. 
The more precise formulization is needed for the present interpretation although 
it is not needed' for Prof. Craig's purpose. 

A third method utilizes the symbolic multiplication process stated in sec¬ 
tion 22, Subscripts of the b's are used to indicate which power sums are col¬ 
lected. Thus [ba(2) + bn(i)(l)] 2 gives 

baba(4) 4~ bsobcft(2)(2) 2[2baobn (3)(l) -f baocbon{2)(l)(l)] + 2bnbn(2)(2) 

~f 4 &hd&i[h( 2){1)(1) + buDoboon(l)(l)(l)(l ) 

where the underscored terms indicate the products given by [b 2 (2)] 2 , 2[b 2 (2)] 
[bn(l)(l)], and [bu(l)(l)f respectively. This is represented by 


1 

1 

4 

2 

2 

4 

1 

22 

20 

21 

20 

11 

11 

10 


02 

01 

01 

11 

10 

10 




0.1 


01 

01 







01 


The underscored terms are the only ones remaining when = 0. 

This method is especially useful when a large number of formulae are to be 
computed, as in the next section. 

24. The Partition Representation of Formulae of Total Weight g 8. The 

partition representation of fn(/r) when r ^ 8 are given in section 21. The 
partition representation of the remaining formulae of total weight ^ 8, which 


do not contain 

unit parts, 

are given below 

22 1 

1 

2 


22 

20 

11 



02 

11 


32 1 

1 

3 

6 

32 

30 

12 

21 


02 

20 

11 
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42 

1 

1 

8 

6 

4 

6 

3 

12 







42 

40 

31 

22 

30 

21 

20 

20 ' 








02 

11 

20 

12 

21 

20 

11 













02 

11 






33 

1 

6 

9 

1 

9 

9 

6 








33 

31 

22 

30 

21 

20 

11 









02 

11 

03 

12 

11 

11 













02 

11 







222 

1 

3 

12 

6 

4 

1 

6 

8 







222 

220 

211 

201 

111 

200 

200 

110 








002 

Oil 

021 

111 

020 

Oil 

Oil 












002 

on 

101 






62 

1 

1 

10 

10 

6 

10 

20 

10 

20 

16 

60 




52 

50 

41 

32 

40 

22 

31 

30 

30 

12 

21 





02 

11 

20 

12 

30 

21 

20 

11 

20 

20 











02 

11 

20 

11 



43 

1 

3 

12 

e 

1 

4 

12 

18 

12 

3 

18 

36 

36 


43 

41 

32 

23 

40 

13 

31 

22 

30 

03 

21 

12 

21 



02 

11 

20 

03 

30 

12 

21 

11 

20 

20 

20 

11 








- 


02 

20 

02 

11 

11 

322 

1 

2 

4 

12 

3 

1 

4 

6 

12 

12 





322 

320 

311 

221 

122 

022 

301 

220 

121 

211 






002 

Oil 

101 

200 

300 

021 

102 

201 

111 





1 

2 

6 

12 

12 

12 

24 

12 

24 






300 

300 

102 

021 

201 

111 

210 

120 

111 






020 

Oil 

020 

101 

020 

Oil 

101 

101 

101 






002 

Oil 

200 

200 

101 

200 

Oil 

101 

no 





62 

1 

1 

12 

16 

6 

30 

20 

16 

20 






62 

60 

51 

42 

50 

41 

32 

40 

31 







02 

11 

20 

12 

21 

30 

22 

31 






16 

30 

120 

46 

10 

60 

120 

90 


16 

90 




40 

40 

31 

22 

30 

30 

30 

21 


20 

20 




20 

11 

20 

20 

30 

12 

21 

21 


20 

20 




02 

11 

11 

20 

02 

20 

11 

20 


20 

11 




02 11 
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53 


44 


422 


1 

3 

16 

10 

1 

15 

30 

10 

6 

30 



53 

51 

42 

33 

50 

41 

32 

23 

40 

31 



■ 

02 

11 

20 

03 

12 

21' 

30 

13 

22 



16 

60 

90 

16 

30 

10 

30 

60 

90 

90 

46 

60 

40 

31 

22 

13 

31 

30 

30 

30 

12 

21 

20 

20 

11 

11 

20 

20 

20 

03 

21 

12 

21 

21 

20 

11 

02 

11 

11 

20 

02 

20 

02 

11 

20 

11 

11 

11 











02 

11 

1 

12 

16 

8 

48 

1 

16 

18 





44, 

42 

33 

41 

32 

40 

31 

22 






02 

11 

03 

12 

04 

13 

22 





6 

96 

36 

72 

48 

16 

72 

144 

9 

72 

24 


40 

31 

22 

22 

30 

30 

21 

21 

20 

20 

11 


02 

11 

20 

11 

12 

03 

21 

12 

20 

11 

11 


02 

02 

02 

11 

02 

11 

02 

11 

02 

11 

11 










02 

02 

11 


1 

2 

4 

16 

6 

4 

8 

4 

24 

16 



422 

420 

411 

321 

222 

401 

320 

122 

212 

311 




002 

Oil 

101 

200 

021 

102 

300 

210 

111 



1 

16 

6 

12 









400 

310 

220 

211 









022 

112 

202 

211 









1 

2 

16 

32 

12 

3 

24 

24 

48 

48 



400 

400 

310 

310 

202 

022 

211 

220 

211 

121 



020 

Oil 

110 

101 

200 

200 

200 

101 

101 

200 



002 

011 

002 

Oil 

020 

200 

Oil 

101 

110 

101 



8 

16 

12 

24 

12 

16 

48 

96 

24 

24 



300 

300 

210 

021 

120 

300' 

201 

210 

111 

210 



120 

021 

210 

201 

102 

111 

120 

111 

111 

201 



002 

101 

002 

200 

200 

Oil 

101 

101 

200 

Oil 



3 

24 

6 

48 

24 








200 

200 

200 

200 

110 








200 

, no 

200 

110 

110 





1 



020 

110 

Oil 

101 

101 








002 

002 

Oil 

Oil 

101 










50 
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332 

1 

1 

9 

12 

6 

2 

18 

18 

6 

12 



332 

330 

222 

321 

312 

302 

212 

221 

320 

311 




002 

110 

on 

020 

030 

120 

111 

012 

021 



2 

9 

18 

6 



i 






301 

220 

211 

310 









031 

112 

121 

022 








i 

9 

18 

6 

12 

12 

18 

9 

72 

18 

36 



220 

220 

310 

301 

310 

202 

112 

211 

112 

211 



110 

101 

020 

020 

Oil 

110 

200 

110 

no 

101 



002 

Oil 

002 

Oil 

Oil 

020 

020 

Oil 

110 

020 



1 

6 

12 

9 

18 

36 

36 

18 

36 

72 

36 


300 

300 

300 

210 

210 

210 

201 

201 

210 

210 

111 


030 

012 

021 

120 

102 

012 

111 

021 

101 

111 

in 


002 

020 

Oil 

002 

020 

110 

020 

no 

021 

Oil 

no 


9 

18 

36 

6 

36 








200 

200 

200 

110 

no 

i 







110 

101 

110 

110 

no 








020 

Oil 

Oil 

no 

101 








002 

020 

Oil 

002 

on 







2222 

J 

1 

4 

24 

24 

32 

3 

24 

8 





2222 

2220 

2211 

2201 

2111 

2200 

2011 

11U 






0002 

0011 

0021 

oni 

0022 

0211 

nu 





6 

12 

48 

96 

48 








2200 

2200 

2011 

2011 

nu 



, 

. 




0020 

0O11 

0011 

0101 

1100 








0002 

0011 

0200 

0110 

0011 








24 

48 

90 

16 

48 

H 16 

32 






2001 

2010 

2100 

0111 

1011 

1011 

0111 






0201 

0201 

0111 

0111 

1110 

0111 

1101 






0020 

0011 

0011 

2000 

0101 

noo 

1010 






1 

12 

32 

12 

48 








2000 

2000 

2000 

1100 

1100 








0200 

0200 

0101 

1100 

0110 








0020 

0011 

0110 

0011 

0011 








0002 

0011 

0011 

0011 

1001 
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25. The Formulae for the Sample Moments about a Fixed Point in Terms 
of the Moments of the Universe, The partitions of section 21 and section 24 
can be immediately interpreted to give the formulae for the moments of the 
sample function. For example 

( fsifi ) *= 6a btfiHh "f (6362 4 - 362162 + 6621611)71(11 — l)na|U2 

and the value of ^21 (/a , ft) as given in section 15 can be read by inspection. 
The value of the 6 's are to be inserted for any specific function. The coeffi¬ 
cient of til in. tlie expansion of aaf/u) is 

(62 “b fibjbii 8611 ) 71(91 ~ l)(?i ~~ 2 ). 


In case /2 = rtii> 62 = 


71 - — 1 . 


n 


2 * 


and 611 


—r- so that the coefficient is 
n 2 


(71 1) (n — 2) (?t, a — 3n z + 9 ti “ 15) 

— 


as indicated previously by Tchouproff ( 10 , 192) and Church (9, 82). 

The partitions of section 21 give the 8 formulae yur.ovj which Tchouproff 
gave (10,155). In this case/ r = ml and every 6 is 0 except those having single 

subscripts and these equal 

The partitions of section 21 give the formulae p t .{JD which were given by 
Tchouproff ( 10 , 186). In this case it is only necessary to take j t = m T and to 
give the b’s the proper values. Tchouproff has arranged his results according 
to decreasing powers of n. As an illustration we derive his result for v \. (») =* 
^i(???. 4 ). From section 21 

Plf/l) — biTl/lti + 362297(91 “ 1)/12 


and from Table II 


so that 



b 4 


(97 — l) (?t 2 -3n + 3) 

ft 4 


and b 22 


2 n - 3 

9 1* 



= V* + - (6^2 “ 4fti) — \ (15^2 - + ~3 (^2 ■" 3/J 4 ) 

Ti n n 


as indicated by him. 

The partitions of section 24 also give formulae which have appeared before. 
For example the partitions 

1 1 
22 20 

02 
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which symbolize the formula 

Ms(/«) “ + (&a + 2bu)tt(Tl ~ 1 )mi 

become 

j4(w 5 ) = - n - 3 " [(n — l)m 4- (n 2 — 2n + 3) *4] 

ft* 5 

which was early derived by "Student" (8, 3) and Tchouproff (10, 192). Simi¬ 
larly the partitions of 222 and 2222 give the formula for and and 

which were given by Tchouproff (10, 192^193) and Church (9, 82), 

Sections 21 and 24 can then be used to write the moments about a fixed 
point of a sample function in terms of the moments of the universe, In the 
case of new functions the b’s must first be determined. Formulae involving 
unit columnar partitions are not included. If the formulae were desired in 
terms of moments about a fixed point of the universe, it would be necessary 
to write in addition all possible partitions. See for example the last formula 
of section 23. 

26. The Formulae For Moments of Any Sample Function in Terms of Mo¬ 
ments of the Universe. The partitions of sections 21 and 24 are also useful in 
writing the formulae for the moments of the sample moments- It is necessary 
to make the usual adjustments in changing from moments about a fixed point 
to moments: 

Mfr) = *i(A) - ,;u) 

Mll(/ri i /.,) /-llC/Vi,/„) **“ j /r,)* 

The particular two way partitions which are involved in this adjustment are 
immediately recognizable. They are the ones which have an entry which is 
the only entry In the row and in the column in which it is. Thus 3 gives 

220 

002 

one of the terms contributing to Anf/s). In addition its coefficient is the 
same, if sign is not considered, as the coefficient of j4(/ s ) m!(/0 in the expansion 
of f*a(/a) in terms of moments of f% . This has to be true since each is the number 
of ways of forming 220. And so in general the remaining function of n accom- 

002 

panying this adjustment is the product of the coefficient associated with 22 
and that associated with 2, The sign is plus when odd numbers of moments 
are multiplied and minus when even numbers of moments are multiplied. 
Hence 3 contributes — 3n 2 ba to the adjustment to moments and the total 
220 
002 

contribution of 3 to the value of ^ 3 (/ 2 ) is 36a[ti(w - 1) - n 2 ] *= More 

220 
002 
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extensive study leads to the following general method of using the formulae of 
section 24. 

A. Write the coefficient of every two way partition according to section 25. 

B. Block off each single entry by drawing a line through its row and column. 
Bor example 

6 

, 22M 

082(1 

AHAO 

r KjXjXTu 

The resulting partitions, 22, 2, 2 are called component parts. 

C. Form new partitions by eliminating component parts one at a time, two 
at a time, three at a time, etc. from the original partition in all possible ways. 

D. Form the coefficient of the resulting parts acceding to the methods of 
section 25. Multiply by (—where s is the number of resulting parts. 
The values of b will not change. 

E. Multiply in addition by s — 1 when the component parts are all taken 
separately, _ 

6 

As an example we find the contribution of the partition 2200 to the value 

0020 

0002 

of It gives 

6bl[n(n — l)(n. — 2) — 3 n 2 (n — 1) + 2tt a ]/n/izJU2 — Wnb^niA. 

Similarly 1 contribi. 

2000 

0200 

0020 

0002 

— 4 nn m -f 6n 3 (n — 1) — 3n*]nl = — 2)^1. 

We use the method in finding the coefficient of a! in the expansion of na( m s). 
We find first the coefficient of juH in the expansion of jUj(A). It is indicated by 
the partitions 


1 

6 

8 

200 

200 

110 

020 

Oil 

Oil 

002 

011 

101 


so that th& coefficient of iA is 

bl[n(n — 1 )(n — 2) — 2n{n — 1) + 2 n z ] + — 1)(« — 2) — ^{n — 1)] 

+ 86un(n. ~ l)(n- — 2) = 62(271) -f- 1 (— 2?i< £ + 2 ri) 

+ 8buit(n — l)(n — 2). 
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Ti — l —I ,, t 2 (n — l)(n s - I2n + 15) 

When h ~ —^r' *md fc n == “^r thlS becomea --- a *» 

previously given by such authors as Tchouproft (10, 194), Church (9, 82), 
Carve? (Richardson) (11,271). 

The general Tchouproff-Church formulae for the third and fourth momenta 
of the variance may he written out in this way as may many other moment 
formulae which have not been printed, 

27. The Thiele Moments of the Sample Function in Terms of the Moments 
of the Universe. It is possible also to write the Thiele moments of the sample 
function in terms of the moments of the universe. The technique is very 
similar to that of the previous section. The basis of the transformation is 
now the formula for Thiele moments in terms of moments about a fixed point 
rather than moments in'terms of moments about &, fixed point. The results 
are the same us those of the last section when a double or a triple product of 
/'s is involved, but they differ with the introduction of a larger number of 
products, The partitions having component parts are broken up into these 
component parts as before but the parts are combined in all possible ways. 
Multipliers are determined as before with the exception that there is a multi¬ 
plication by (—l)*~ l (a -=• 1)1 where s is the number of resultant parts. Thus the 
2000 

term0200contributes- 4nn w — 3n*(n — l} 2 + 12n a (n — 1) — = 

0020. 

0002 

- 6 fc 4 ttjU 2 to the value of X 4 (/s). 


28, The Moments About a Fixed Point of the Sample Function in Terms 
of the Thiele Moments of the Universe. We return to the problem of section 
25, only we wish to express the results in. terms of the Thiele moments of 
the universe. We must use the formulae of section 12, , 



where p,- H 1. 

Thus p r will contribute to all partitions of r and inversely the contributions 
to a given partition are composed only of these terms which are obtained by 
combining the different elements of the partition. Since the numerical coeffi¬ 
cient in the expansion of a r is the number of ways in which the r units can 
be collected to form the partition, it follows at once that the complete X coeffi¬ 
cient can be obtained by grouping the parts of the partition in all possible 
ways, determining the coefficient of each according to the methods of section 25, 
and adding. In this way the formulae of section 21 can be used to give expan¬ 
sions iti terms of partition, moments. For example the representation of /q(/a) 
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1 16 10 16 

6 4 3 2 

2 3 2 

2 

gives at once 

b g n)\e + 15[baft *f b«u( 7 i. — 1 )]XA 2 + lOlfon + b 3 i7i(n — l)]xjj 

+ 15[6eft + 36 4 i7i(?i — 1) + bmn(n — l)(w — 2)]\|. 

The partitions of section 21 can be made to give the formula mi(Q which 
were given by Thiele ( 1 , 45-46), For example the formula for ^(/ 4 ) is indi¬ 
cated by 

1 3 

4 2 

, 2 

, so that 

= b*wX4 ~1" 3[biH T b22?i( , n — 1)]X 2 

and since 

, (n — 1) {■ n 2 — 6 ft + 6 ) , , 2n — 3 

bi = - 7 --—-—- and 022 7 —- 

rt 4 ft 1 

t tl , (n - 1 ) (n 2 - 6 n + 6)\ 4 6 (n - l)\? 

wW - - IP - IP -’ 

which agrees with the result as given by him ( 1 , 45). 

The two way partitions of section 24 can be used similarly. This device 
for changing to tfye \*s is due to the ingenuity of R. A, Fisher who applied it to 
the case where f r — h r . 

As an illustration we write from section 24 the value of in terms of Vs. 
The partition representation 

112 

22 20 11 
02 11 


gives at once 

& 2 ?iXi "f* [bzft 4* b\n(n — 1)]\1 T 2[6s?i -4- b\in(n — I)]Xs 

which agrees with the result of section 12. The other illustrations of that 
section may be written out similarly. 

As a final illustration of this technique we find the coefficient of X? in the 
expansion of nitifa, fa)- The partitions are 

2 9 IB 6 

301 220 211 310 

031 112 121 022 
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and the coefficient is 

2[blbtfi 4 63 &Ltft(tt — 1 )] 4 9[6aM 4 - blibmin — 1 )] 

4 18(636111 4 6iv6i^i(fi —■ 1 )] 4 6[6s&2?i 4 h$bzibtfi(Ti — I)]. 

If the 6's are inserted to form the h'&, the first and last terms become 0 and the 

others give —This agrees with the value as given by R. A. Fisher 
n{n — Ip 

(3, 208). 

29. The Moments of the Sample Function in Terms of the Thiele Moments 
of the Universe, The partition representations of section 21 and section 24 
can be used similarly to write formulae for the moments of the sample function 
in terms of the Thiele moments of the universe. I t is .only necessary to use the 
general plan of section 2(3, but to write the coefficient of every resulting parti¬ 
tion according to the method of section 28, For example the partition 



gives the coefficient 

t>S(ft 4 4n (Z) 4 4 6 n (s) 4 n (4) ] — ibt[n 4 3n 2 (ft — 1)4 ft S (n — l)(n — 2 )) 

4 GbV 4 n\n - 1 )] - 3 bW - bj[ft 4 - 4ft 4 4 6 n - 3ft 4 ] = 0 . 

30. The Thiele Moments of the Sample Function in Terms of the Thiele. 
Momenta of the Universe, The partition representations of section 21 and 
section 24 can also be interpreted to give the Thiele moments of the sample 
function in terms of the Thiele moments of the universe, The scheme is 
similar to that of section 29 except that the formulae for changing to Thiele 

2000 

moments are used as in section 27. For example the partition 0200 has now 

0020 

0002 

associated with it 

&*(ft 4 4ft (2) 4 3n <3) 4 fift f3) 4 ft <4) ] — *ibi[v? 4 3 n\n — 1) 4 ft 2 (ft ~ l)(n — 2)] 
- 3i{ft*(n - l) 2 4 12&S [ft 3 4 « a (ft ~ 1)] - 66V - 0. 

f 

The application of this method enables one to write the formulae of section 13 
(and others which they typify) with relative ease. It is now possible to com¬ 
plete the task left unfinished in section 15, We do not take the space necessary 
to write all the terms of "SniJs, fa) since the lengthy expression can be obtained 
quite readily from the representation of section 24. One term, say the coeffi¬ 
cient of X(j> 2 i is represented by 
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1 9 12 6 

330 222 321 312 

002 110 011 020 

/** and gives 

9[i>aiw 4* bhbtfi(n — 1)] -f 12[&aM + bzb u bnn(n — 1)] 

+ 6[6s^n + b z bijhn(n — 1)] 

which becomes when b 3 =* h = - and hi = bn = , This 

n(n - 1) n n(n - 1) 

agrees with the result given by R. A. Fisher (3, 209). 

For simplicity of form it is logical to use this fomulization of results, Thiele 
moments in terms of Thiele moments, and it has been used by Thiele (1), 
Craig ( 2 ), Fisher (3) and Gcorgescu (4). They however have used different 
sample moment functions. Thiele and Gcorgescu used the Thiele moments 
of the sample, Craig and Georgescu the moments while Fisher introduced the 
fc function. 

The present discussion deals with the corresponding partition moments of 
any rational integral isoharic moment function of the sample. The results 
indicated here give many of the results of the previous authors as special cases. 
For example the symbolic formula 44 of section 24 gives the WXata) of Thiele 
(1, 45), the Sazin, va) of Craig ( 2 , 57), the *(44) of R. A. Fisher (3, 210 ) as 
special cases when the formula 44 is given the interpretation of this section. 
Some may prefer the Craig attack ( 2 , 21-35) to the partition method, It 
should be noted that the formulae of sections 21 and 24 can he used in place 
of part of the Craig method. Thus his formulae ( 2 , 22 ) 

j'ao = Aho “I" 28 AG 0 A 20 "I” 56 A 60 X 30 -f- etc. 

Vn = X 44 d - (12 X.ioXo 2 15 A 33 A 11 ) stc. 

are immediately obtainable from the symbolic formulae by writing A’s in place 
of b } s and by using row, rather than column, subscripts. It is then necessary 
to compute the values of A*,*, as given by him (2, 16-17, 40) and to insert 
in his expansions of jS* 2 (v„,, v„) in terms of Fs. For example 

$n(v3, Va) => - [vw 4" (n — l)vas — fivicvoi] • (2, 32) 

n 

and from the symbolic formulae of sections 21 and 24 

V&G — Aj 0 4” 10 X 30 X 30 

Vfl 2 — X 32 4" X3aXo2 4" 3 Xi2X20 4“ OXjjXu 

V 30 = Xao 

V 20 — X 20 
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so that 

&i(n 8 , v s ) = i [X$& + (n — L)Xs« + 0 X 30 X 20 + (n — 1)( 6 X 21 X 11 + 3X 2 iX?fl)l (2,30) 

7t 

which agrees with that given by Prof, Craig (aside from an obvious typographical 
error). The insertion of the values of X gives the value as indicated by 
Xu( 7 » a , nh) of section 13 and by the first method of the present section. 

31. Special Rules for the Determination of the Coefficients in the Case of 
the Fisher and Georgescu Analyses. R, A. Fisher (3) gave a number of simple 
rules which assist greatly in the determination Df the coefficients accompanying 
tho partitions. Georgescu (4) also introduced special rules for the evaluation 
of the coefficients of the different partitions he used. It is not to be expected 
that all these rules are applicable in the more general case under present con¬ 
sideration, but the vanishing of such coefficients as that of 2000 leads one to 

0200 

> 0020 

0002 

suspect that there might be some rules which are applicable to this general 
case. A sensible method of procedure is to examine the rules of Fisher and 
Georgescu and determine if they hold in the more general analysis. The special 
rules of R. A. Fisher might be given somewhat as follows. 

A. If a partition has a column with a single entry, that column may be 
eliminated and the factor wT l introduced, 

B. Any partition having a row with a single entry may be neglected, 

C. “We may exclude any partition in which any set of rows is connected 
to its complementary set by a single column only.’' 

D. In determining the algebraic coefficient of a partition the “pattern 5 ' is 
sufficient and precise entries are not needed. Thus the partitions 21 and 35, 

11 42 

although they have different numerical factors, have associated with them the 
same function of n , This value is indicated by the pattern xx which has asso- 

xx 

dated witli it the function . As a result of this property Fisher was able 

to provide a table (3, 223-226) of useful patterns which is of great assistance 
in writing the value of the coefficients, 

E. Formulae of moments of k functions involving fa can be derived from 
corresponding formulae not involving fa., “The effect upon the corresponding 
formula of adding- a new unit part to the partition is ( 1 ) to modify every 
term in the formula by increasing the suffix of one of its k functions by unity 
in every possible way, and ( 2 ) to divide the whole by ( 3 , 206). 

Two of the important Georgescu rules may be stated. 

A / . .The numerator function (aside from numerical coefficient) j s not altered 
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if column?! arc changed to rows and vice versa. Thus the coefficient of s? in 

*S(3 2 ) ~ ~T K T~i T' i ' ancl coefficient of s\ in £(2 3 ) is Georgescu 

(jv -\r 1/ (iv -t-lp 

has replaced n by N + 1. 

B f , All partitions which can be broken up into component parts have coeffi¬ 
cients of 0. This is extended to include all partitions which have as component 
parts other partitions. Thus 


2100 

1100 

0012 

0034 

has a coefficient 0 as does the equivalent 

2010 

1010 

0102 

0304 


' 32. Special Rules for the Determination of the Coefficients in the More 
General Case. In the more general case we have 

A. If a partition has a single column with a single entry, c, that column 
may be eliminated and the value b c inserted as a multiplier. This is imme¬ 
diately evident since the contribution of that column to each term in the 
expansion is b c times its value if the column were eliminated. 

B. The coefficient of any partition having an entry which is the only entry 
in its row and column, is 0. 

This rule, which saves considerable labor in that it makes unnecessary the 
computation of the coefficients of many of the partitions of section 24, is estab¬ 
lished in this way. Without loss of generality the partition may be repre¬ 
sented by 

Cn C \j Cj.3 • ■ ■ Cid 0 

cai C22 C23 • ■ • Cju 0 

iTu+i.fl+i = C31 C32 C33 * ■« Ca v 0 


C U 1 CuS C u 3 * • 1 C mi 0 

' 1 

0 0 0 0 Cu+i, 

/ 

and 7 r u may represent the partition containing the first u rows and the first v 
columns. We determine the coefficient of nvn^+i in terms of the coefficient 
of ir u .v* Consider first any grouping of the u rows of r u l vinto w rows. There 
will be w corresponding groupings of tV+ i in which the last row is added, in 
turn, to each of the w rows and another w -f- 1 rowed term in which it is not 
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added. In each of the first w cases the coefficient by rule A is multiplied by 
b C|J+ i, fi+ i. In the case of the w 1 rowed partition the coefficient is multi¬ 
plied by b* tllv+1 , and n tu,) is replaced by n (l,,+1) . A final adjustment takes 
care of the transition from the moment about a fixed point of the sample 
function to the Thiele moment of the sample function. This adjustment de¬ 
mands the multiplication of the coefficient of ir«,« by b„ u+ i, „ +l n and the sub¬ 
traction from the sum of the other terms* If B u is the coefficient of the w 
rowed form, it follows at once that the corresponding coefficient is 

B„b' +1 , n i [W"> + n iw+l1 - n n M ] = 0. 

This holds for the expansion of any terni of tt,, , v and hence the coefficient of 
tti,+i , D-n is 0. Of course the argument holds if the partition has more than 2 
component parts. 

It thus appears that this rule holds not only for k F and w r as Fisher and 
Gcorgescu have noted, but for 

C. The coefficient of any partition which can be broken into component 
parts is 0, In this sense a component part is any group of rows or columns 
which have no entry in common with any other group of rows or columns. 
It corresponds in matrix language to a matrix which results when one matrix 
is zero bordered by another matrix although rows and columns may thereafter 
be interchanged. 

The proof of this more general case follows the general line of the simpler 
case although the reasoning is more complicated. For example the coefficient of * 


Cii 

C)2 • • ‘ 

Civ 

0 

0 

C21 

C21 ’ • ■ 

Ci„ 

0 

0 

C3L 

Cj2 1 1 • 

C 3 v 

0 

0 

Cui 

C,(2 ■ 1 1 

Ciiu 

0 

0 

0 

0 

0 

Cu-n,v+i 

, 13 - 1-2 

0 

0 * * ■ 

0 

C«-f2, v+1 

c tM-2 ,14-2 

is 0 since any w rowed term of the ir u , „ contributes 

, p+i- 3 -tfu+a * m-i * 


v + 2 

[wn M + n 

f ” +,) - w 


i v+i^u+j i b+j ^ c «+i« »+!*u+i . *+2 [wOo 1 1) ^ ^ -j- 

— n(n — I) = 0. 

Other special rules of Fisher and Georgescu do not hold in the general case. 
Thus Fisher rule B is not generally true since*the partitions 

12 and 22 

30 20 
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have respective algebraic coefficients of bib 2 n -j- b 3 ihn(n — 1) and 

bibiii -j- b^bni^n — 1) 

and these are not in general equal to 0. 

The Fisher rule C is replaced by the somewhat less general C of the present 
section, 

The Fisher rule D is not applicable in the general case. The Fisher rule D 
is applicable in all cases in which the value of the ... is completely deter¬ 
mined by the number of parts for in this case,the particular value of each 
part is not pertinent. We may say then that the Fisher rule I) is applicable 
to all cases in which ... 3 ,^ is a function of p, n where p is the number 


of parts. This condition is satisfied by b p * 


_ (-l)'(p-Ql 

l J ■ * n . * “ v 


and the 


w (p> 

coefficients are worked out for it in Fisher’s paper. The same method is 
applicable to oilier functions satisfying the general condition although the 
values of the coefficients will of course vary with the definition of b. 

The Fisher rule E is not applicable to the general case. Its validity, from 
an algebraic standpoint, depends upon the Fisher properly B which is not 
generally applicable. The Fisher rule E as applied to the more general case 
gives correct terms but it does not give all the terms. For example the Fisher 
rule E applied to ta(fa) gives 

UM = X t + ~4 

n n — 1 

Mh ; k,) ~ I + 

The application of a corresponding rule to 

M/a) = bln'Ki -f- 2 (bln -f- &uw(tt ~ 1)]X2 

would give 

k2t(/a,/i) = b^hifiXs 4~ 4;[f)2&va 4" ™ l)|X;vXa 

while the correct result is indicated by 


1 

221 


4 

210 

Oil 


2 

201 

020 


4 

111 

110 


and is 


Xai(/2/i) “ -j- 4" l)]Xa\2 4- 2[b2bift 4~ b\biV/{n — 

4~ 4" &iiM(n •— 1)]XA2< 
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The difference is due to the vanishing of the two middle terms in the case of 
the k functions, 

The rule B', which Georgescu found most useful in computing and checking 
his formulae, is. not generally true. It is not even true in the case of the le 
function, as can be discovered by using it on the list given by R. A. Fisher 
( 3 , 210 ). It is interesting to note that the Georgescu method, while not being 
able to utilize many of the special rules of the Fisher method, does use this rule 
which is not in general adaptable to the Fisher method. 

33, Special Rules in the Case of the k 1 Functions. Special rules can be 
worked out for other sample functions. As an illustration we examine the 

function li f which was defined in section 19. It is recalled that &ip — ~ and 

that bjjp 1 ,.. = 0 for all other cases, It follows at once that 
A. Any partition having any entry other than unity (or zero) may be 
neglected. 

. B. The value of V is -L. 

n {p) 

As an illustration we write the value ha). From the partitions of 

section 24 we select 


36 


36 

111 


110 

111 

and 

110 

110 


101 



011 


as being the only partitions making a contribution. The result of section 19 
follows at once. 

34. The Case of a Normal Universe. A normal universe is characterized by 
the relationship that \ = 0 when r > 2 . It follows that it is only necessary 
to compute the coefficients of those partitions giving powers of X 2 . 

Wishart (5) (7) has developed the partition analysis of the fe function in 
the case of a normal parent while Georgescu has studied the corresponding 
m function. It is not the purpose of this section to make extensive study of 
the case of the normal parent but simply to indicate that the results of section 24 
are immediately applicable. As an illustration we write the values of XiC/V), 
X 2 G 2 ), X 3 Q 2 ) and Xi(/ S ) in the case of a normal universe. The terms are given 
successively, by 


1 

2 

8 

48 

2 

11 

110 

1100 


11 

. 011 

0110 



101 

0011 




1001 
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and hence ■ 

Xj {ft) ~ bvriXi 

Hfi) = 2 [ftin + b\ v n{n - l)]x! 

^(/a) — 8(6277 d - 3 & 2 &jiw(?i — 1) d - 6'iitt(w — l)(?l — 2)]Xs 

h \{ fi ) — m\n d~ 66561 — 1) d~ 61177(77 — 1) d - — l)(w — 2) 

+ 26) 1 n(w i — l)(n - 2) -f b\,n{n - l)(n - 2)0 - 3)1X2- 

It is only necessary to substitute the 6's to obtain the results for different Values 
of /. This is done in Table IV, 

' TABLE IV 


The first four Thiele moments of fi for various sample functions in the case of a 

normal universe 


Sample 

func¬ 

tion 

Mfi) 

hi (ft) 

, 

1 

Xs(/i) 

M/.) 

Wj 

(n - 1} X. 
n 

1 2(n-l), ! 

, n* 

8(n - 1) 

«* Xl 

48 (n - 1 ) Xs 
n 4 


x 2 

| 2x5 

8X2 

48 X 2 

, n — 1 

(71 - l) a 

(n - l) 3 

h 

n 

■ 2(n — 1) %2 

« A& 

! n 2 

8(rt - 1 )X 2 

1 n3 

48 (?i“l)X 2 

n 4 

1 

^2 

2 X 5 

n 

8X| 
n 3 

48 X 4 

n 4 

hi 

\ n 

2\a 

8X2 

48 X 5 

A 2 

n — 1 

(»- i) s 

(?7 — l) 3 

‘ hi 

0 

2X2 

8 (ft - 2 )X 2 

48 (if - 3 ft + 3 )Xj 

ft2 

n(n — 1) 

n 2 (n - 1)* 

7 l 3 (?7 — l) 3 


One surmises that the general value of 

X r (/ 2 ) is 2 r_1 (r - 11000-.. 0 

01100 ... 0 

00110 . • • 0 


00000 ... n 
10000 ... 01 
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where B represents the b coefficient of the r rowed partition, 
appears consistent with the fact that 




2 r rU r E H 

(n - l) r 


This induction 


as shown by John Wishart (7). The whole subject of the Thiele momenta of 
the general function in the ease of a normal universe would make an interesting 
subject of investigation. 


35, Summary and Conclusion. The contributions of this paper include 

1. The definitions of specific moment functions in terms of power sums, 

2. The use of indeterminate multipliers in representing a general isobaric 
moment function. 

3. The finding of the expected value of products of these functions by alge¬ 
braic methods. 

4. The use of tables in writing these expected values in terms of moments 
(or of moments about ft fixed point) of the universe. 

5. The finding of the expected values of specific moment functions by sub¬ 
stitution. 

6. Means of establishing the expansion of new moment functions which are 
defined by their expected values. 

7. The introduction of the sample function of weight r whose expected 
value is ji r . 

8. The introduction of the sample function of weight r whose expected 
value is 

9. The two way partition formulae of weight g 8 which do not involve 
unit parts, 

The use of these partition formulae in writing: 

10. The moments about a fixed point of f t in terms of moments. 

11. The moments of / r in terms of moments. 

12. The Thiele moments of f T in terms of moments. 

13. The moments about a fixed point of f T in terms of Thiele moments. 

14. The moments of f r in terms of Thiele moments. 

15. The Thiele moments of j T in terms of Thiele moments. 

16. Special rules in the case of Thiele moments. 

17. The applicability of these results to a given sample moment function 
and hence the derivation of varied results, of such authors as Thiele, Tchouproff, 
Church, Fisher, Craig, and Georgescu, from the same partition formulae. 

18. The simplicity of the formulae when h r is used as the sample function. 

19. The application of the synthetic formulae to the Craig method. 

20. The applicability of the theory to a normal universe. 

The introduction of such general procedure opens up a wide field for future 
study, It is impossible in a single paper dealing with so broad a subject to do 
more than to outline the general scheme by which two way partitions can be 
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used as a central formulization of the various formulae for moments of moments. 
More detailed proofs and more extensive analysis of the more important of the 
special cases will undoubtedly be supplied by later writers. 

In later papers the author will show how the partition representation can 
be used in the case of multivariate distributions and how it can also be used, 
in connection with the sampling polynomials introduced by H. C. Carver (11), 
to represent the more complex formulae obtained in the case of finite sampling. 

It is obvious that the author is indebted to the classical moment studies of 
Fisher and Craig. He also wishes to acknowledge his indebtedness to Prof. 
Craig and to Prof. Carver who have read the manuscript and have made 
valuable suggestions. 

The University of Michigan. 
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A COEFFICIENT OF CORRELATION BETWEEN SCHOLARSHIP 

AND SALARIES 

INTRODUCTION 

r 

Some might doubt that it is correct to apply ft coefficient of correlation to 
show the relationship between scholarship and salaries. This coefficient can 
be trusted to give at least a rough approximation, which is all that is necessary 
in the inexact science of vocation. It is fictitious accuracy to be too finical 
in the application of formulas. Therefore, a coefficient of correlation between 
scholarship and salaries is a valuable part of human knowledge. 

Would it be worth while to find this coefficient if it is based upon the experi¬ 
ence of the American Telegraph and Telephone Company? Since the employ¬ 
ment practices of this company are not representative of the employment 
practices of business at large, one might doubt the validity of drawing general 
conclusions from such specialized data. The coefficient for business at large 
is probably less than the coefficient for the Bell System; the value of this knowl¬ 
edge is enhanced if we know the latter coefficient. Since this company is very 
large, a coefficient between scholarship and salaries would be, valuable, even if 
this coefficient applies only to the Bell System and to other companies having 
approximately the same employment practices. 

An article 1 by Mr. Walter S. Gifford, President of the Bell System, contains 
a discussion of some of the relationships between scholarship and salaries. 
President Gifford, however, did not determine in the case of the Bell System a 
coefficient of correlation between scholarship and salaries. 

The purpose of this article is not a new contribution to statistical method, 
but is an application of the method 4 of finding the coefficient of correlation 
when the two variables have not been quantitatively measured. This method 
will be applied to the chart on, page 672 of President Gifford’s article, in order 
to determine for the Bell System the coefficient of correlation between scholar¬ 
ship and salaries. 


FINDING THE COEFFICIENT OF CORRELATION 

An explanation of the chart. It is based on the experience of 2,144 Bell 
System employees over five years out of college. First, assume these employees 


1 H entitled “Does Business Want Scholars?” and was printed in the May 1928 issue 
of Harper'B Magazine, 

4 It can be found in Elderlon’s “Frequency Curves and Correlation. ” 

GO 
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are grouped according to their grades in college. In the high scholarship group 
put those who graduated in the highest third of their classes. The middle 
and low scholarship groups are formed in like manner. Secondly, suppose the 
same employees are divided into three equal groups according to their salaries. 
Then, the salary of any one of the employees would be high, middle, or low. 

Assume a hypothetical group of 300 employees who are college graduates. 
Suppose that the scholarship of 100 of them was high, that the scholarship of 
100 of them was middle, and that the scholarship of the others was low. Also 
assume that the salary experience of these 300 employees is .the same as that 
of the 2,144 employees of the Bell System. 

The 300 employees can be grouped according to the following table. 


TABLE NO. 1 


Salary 

Scholarship 

Totals 

Low 

Middle 

High 

High. 

22 

24 

48 

94 

Middle. 

31 

39 

27 

97 

Low. 

47 

37 

25 

109 

Totals. 

100 

100 

- - - 

100 

300 


This table can be combined as follows. 


TABLE NO. 2 
% 



i 

Scholarship 

Salary 




Low & Middle 

High 

High 

C 

d 

Middle & Low 

i 

a 

b 


Then, c = 46, a = 154, d = 48, and b = 52. Assume N = 300. 

Assume a; is a function of grades received in college. Suppose y is a function 
of salaries received. Assume that the frequencies x and y both follow the 
normal curve of error whose standard deviation is equal to one. Also assume 
that the average of $ and the average of y are both equal to zero. It is a 
matter of common knowledge that salaries are not arranged in a symmetrical 
fashion; y is not a linear function of salaries. 

In the formulas which follow, r is the symbol for the coefficient of correlation. 
These formulas are applied to Tabic No. 2. We have 

—j= [ <T** ! dx = — — — ^ — — = .167, and h = ,4316. 

V2v Jo 2 N 
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Also 



(g + 6) - (c + _ _ 187 and k = ,4874. 

2N 


Then, 


H - —U cf 1 *’ = .3635, and K - - 


.3543. 


V2T .~~ 

All the quantities except r in the following approximate equation are known: 




+ K7 — 3)#(k 2 — 3) 4- rwe (^ 4 — ^ 4“ 3) (&* — 4" 3). 

24 1*0 


Therefore, 


.0261/ + ,0681/ + .1034/ + .1062/ + r - .4314 « 0, 


Then, r is approximately equal to .4061. Consequently, for practical purposes 
we can assume that r = .4. 

28 Booby Street John L. Roheht3 

Brunswicb:, Mum; 


NOTE ON THE DERIVATION OF THE MULTIPLE CORRELATION 

COEFFICIENT 

Consider N observed values of each of n variables. These n-N values may 
be tabulated in a double-entry table as follows: 

X u Xu Xu - ■ Xia 

X n Xn X23 ■ * • Xay 


x nl x nS Xnl * • ■ AnX 

where A<* is the & lh value of the i lh variable. 

Using the i th variable as the dependent variable, the general linear relation¬ 
ship between the n variables may be expressed by 

= iffi -(- ,02 Xz -{- * 1 1 4- iffi-l £i-l + iff i-yi 4- ‘ ■ 4- tffn (I) 

where 


i a i is the general parameter which is to be determined empirically; 
x f = X j At j | 

M) is the arithmetic mean of the j ttl variable. 
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By the method of least squares, the constants of (1) must satisfy the normal 
equations: 

(2x5),ai + (ZxiXa)^ + • ■ • + (Sgifr-Oja- 

-1- (ExiS^.-a,^ + •*■ + (Sxix*)^,, - 2xix< 

(2^1),ai + ( 2 a;I) i(h + • * ■ + (Zxzai^Oiai-i 

4" (2a?2#i + i)fa*Hi + ■ * * + (2XzXn)i®n ~ 2Xz£i 


(2x, - _iXi) 4" (SXf_iX2)ifl2 4" ' ' ’ + ( i —lX„) 2X,_iXi 

(2x i+ iXi)iai + (ZXi + iX S )ia 2 -f ■ ■ • 4- (Sxy+ix,,)^,, = 2sy+ix,- ■> 


(2XnXi),’0i 4“ ( 2 X^X 2 )i&2 4' • - ■ 4- (2Xfi),np — ZXnXy 

where 

(M = z (*<* - Mi) (X,-l - Ml). 

fc"l 

But 

(2x,x,) = NujcriVf, , 

(2x5) - Net = Nuwi ( 2 ) 

where 

r»/is the Pearsanian coefficient of correlation between the i th and j°* variables, 
a ;, the standard deviation of the f th variable. 

Substituting the right members of (2) in the normal equations, we obtain 
the system: 

n 

2 Tik<rm i<ik = 0 
fc =1 


2 r 2k<72<Th %dh = 0 

k»L 


2 1*1-1, k i&k = 0 ( 3 ) 

ft-1 
1 

n 

2 r i‘+J. k Vi+Wk i&k = 0 
*“1 


n 

2 r*kF*v i a * = 0 
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where 


Let 


If 

-1. 


■ • r n i0'n<J'j 

hi *iii 

TlfiO'lO’n 

Fnn&nGit 


(4) 


An be the first minor of the element rj/o-.o 1 / in A } {*4 be A with the and k th 
columns interchanged, and #4,* be the first minor of the element in the i tb 
column and i th row of #4, 

Solving (3) for ^ by Cramer's rule, we find 


*4 

A 


But it can easily be proved that 


«A„ = (-l)'-‘ +1 4 )k ; 


hence 


,. / l\i-fc+I Aih 

&k = t -1; —i • 

4« 

Using eofactors of A instead of minors, we have 

„ ' , lV - k+ i (-1) ,+ *A* 2>« 

<at , ( -l) --- 

} 

Without writing the determinant out in full, we notice that the cr’s can be 
factored out* Hence . 

i ' 


where 


iO'k = “ 


2 2 
Clffz 


2 2 


2 2 


■ ■ ■ c, 


Kik 


!! 




r\ Km 


OiK-ik 

*kKJ 


(5) 


i’u • • * ru 



... ■ 


T ^ 7 * 
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Using these derived values for the coefficients, we may write (1) in the sym¬ 
metric form: 

— (X, - M>) + — (Xi - Mi) + ■' • + — (. X. - M.) = 0, ■ 

ffl (72 , An 

or 



For a multiple correlation coefficient, we use the formula 

AT ■ /K , A \T 

__ i __ >°i L _ V=i __ / J 


R? 


IV 


( 6 ) 


which measures the amount of observed dispersion from the regression plane 
in which X,- is the dependent variable. 

Substituting the values for the a’s, we find 


E\ = 1 - 


A' . 

'ST' /Kill 

U\~ 


l'Clj , Xi2 X$j . 

H-:-r 


(T4 


+ 


tr rt / 


KiiW 


Squaring the bracket expression and using (2) we obtain 



The second sum is the sum of the products of the elements in the fe th row 
by the cofactors of the elements in the row. This sum is necessarily zero 
unless k = i; but if k = i, this sum is equal to K. 


Kh 


R] = 1 - ^ (K i{ K) = 1 


K 



Oregon State A-gricth/iure College 
School op Science 
Corvallis, Oregon 


1 


William J. Kirkkam 



72 


NOTES 


NOTE ON NUMERICAL EVALUATION OF DOUBLE SERIES 1 


1. The Euler-Maclaurin summation formula has been extended to two 
variables by Dr. Sheppard, 2 and Mr. Irwin, 3 to determine cubature formulas. 
A more complicated two-dimensional form was given by Baten 1 involving 
product polynomials, for which a remainder term was also calculated. The 
purpose of this note is to apply the simpler formula to the numerical evaluation 
of double series of positive terms. The method may be extended to multiple 
series of order p > 2. If the double series converges one may sum by rows 
(or columns), using the ordinary sum formula twice. The method is to take 
out a rectangular block of mn terms and then apply the formula to the remaining 
terms. By taking m and n sufficiently large one may cause the series resulting 
from the formula to converge sufficiently rapidly to obtain the sum to the' 
desired number of decimal places. For practical work the prror may be es¬ 
timated because of the asymptotic character of the series involved in the Euler- 
Maclaurin formula. 

Write this in the form 


(1) 


2/« - /'*>* + i/w - m - + nv . zru _ 

f(o) - / V « . r i (a) - f%) , , 1V D / |,,_1> (o) -f-%) . 

30240 + 1209000 " " K ' ’ (2c) 1 + 

If $ —> co one has accordingly in the ordinary case of convergence 

«> t *) - f a * + m - ®® + ■ • ■. 

Now define = £ u(x, y) - [ u(x, y) dy + %u{x, b) - - ^3 4- 

V-b Jb 12 

^ tf ' Y2Q~ “ “ ■ and w(y) - £ u(x, y) = J* u(x, y)dx -f- y) — Ux ^^ - -f 

7/) 


cO oq 


<3—1 6-1 b“l 

720 1 * ’ > then ^3 23 u{x } y ) = ^3 ^3 d(x ) y ) v{x) -f- 53 'wj(jf) 

1 3"“1 V“i £—1 V**l i—L v=i 


v(x)dx + - 


(3) 


*'(1) , s"'(l) 
12 + 720 


■f 

+/■*>!. - wu + «» + *21^® _ + .... 


„v n \ a^l b—1 

30240 ‘ + S S U ^ X> V ^ 


1 Presented to the Society, Nov. 30, 1934. 

1 W. F. Sheppard, "Some Quadrature Formulae," Proc, London Math. Soc., Vol xxxii, 
1900. 

1 0. Irwin, "Tracts for Computers,” No. X, Cambridge Univ. Press, 1923, On Quad¬ 

rature and Cubature. 

1 W, D. Baten, "A Remainder for the Euler-Maclnurin summation formula in two 
independent variables,” Amor. Journal of Math., Vol. 54, 1932, pp. 265-275. 
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fl-l l—l » n—1 

Instead of this one may use 2 2 «(*t, y) + 2 w(y) + £The scheme 

1=1 J/“l i/™! 2=1 

of the double series may be illustrated by a sketch of a quadrant of the ay-plane 
in which the point (x 7 y) represents the term u{x, y). 

Evidently by taking a combination of results from (3) one may evaluate 

quite readily such finite sums as ^ 2 u ( x > y) where q and t arc large. 

I s -p V“ r 

As an illustration of (3) consider 2 £ (# 2 + ^V)" 2 * Here one needs to 
evaluate the integral of the summand. The transformations x = .ay tan 6 
and y — \/t lead to a form which may be integrated by parts. The more 
complicated form 2 2 (ff® 2 + 2&xy + cy 2 y for the case in which s > 3/2, 
a > 1, might be handled by using x = Ifl and approximate integration by 
Simpson’s rule. 

Take as a second example £ £ (x + yY P > p > 2* The case of p = 4 was 
carried out by taking a = b = 10 in (3) and carrying the computation to twelve 
decimals. The series involved converge rapidly and a result was obtained 
which differed by 2 in the 12th place from the true value 0.119 733 669 448 + . 

OQ 

By summing diagonally one may convert this to the simple series 2 + 1)^ H 

1 

OD OC 

or2(*-l)s 4 = £ (s -3 — s -A ). The method of summation diagonally may 
2 1 ' 

be extended to 2 E + a v)~ P t V > 2, a > 0, by the applications of the 
Euler-Maclaurin sum formulas (1), (2) in succession after a triangular array 
of terms have been omitted. 

The form 2 £ aT p y~ a can be written as the product of the single series 

2. Another method of numerical evaluation is the analog of that used for 
single series by the author, 6 Instead of rectangles one has right prisms of 
square or rectangular cross-section. Instead of shifting the rectangles one unit 
to the right to determine upper and lower bounds the prisms are shifted diago¬ 
nally so that they go effectively one unit in each variable. In the case of a square 
base each prism is moved along the 46° line one diagonal unit length. For 
the lower bound instead of trapezoids one uses truncated prisms. For example, 
the prism of Height u n „ is cut by two planes, one determined by the upper 
vertices u mn , u m , n+i * «ii+i,« and the other by the upper vertices %!,„, u m ,*+ 1 , 
Wm+i, n+i of the truncated prism. The surface z ~ «(m, n) passes through 
all the upper corners of the truncated prisms. Each prism is composed of 
two truncated triangular prisms. Now the volume of such a triangular prism is 
the arithmetic mean of its vertical edges multiplied by the area of its base, 


New Method for Finding the Numerical Sum of an Infinite Series," Amer. Math. 
Monthly, vol, XL, No. 9j Nov., 1933, pp. 637-642, 
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Hence the difference in volume between the truncated rectangular prism men¬ 
tioned above and the prism of uniform hight z = u mn can be shown to be 


(4) 


(blifnn ~' "Wrn-j-1, m+1 n 2Wrn,r1+l)/®‘ 


Let us consider series whose corresponding surfaces do not rise above these 
truncated prisms. This sort of ji’uncated prism differs less from the volume 
under the surface than the one formed by the diagonal joining the other pair 
of upper vertices and planes through it for upper faces. The lower bound for 
the remainder is the volume under the surface extending to infinity in the 
7ti and n directions plus the sum of these differences. Accordingly one deter¬ 
mines as the lo.wer bound for the remainder m _i, „_i after summing a rec- 

1 n—1 

tangular array X) «{,; the form 

I™L j‘“»l 


(5) 


(2wjh.i -b 2u],„ 5w m ,n)/6 -j- 5 S u 


t-i 


tn+i, I 


« m 

+ 1 4* Mf,n 

7“1 i-l 


n rui j*r?i 

+ iE««,i+ / / u m , H dmdn + / / u m , n dmdn < R m - 

7-1 J 1 jm Jn J 1 


The upper bound may likewise be given as follows: 

r qo r * to “| 

(6) R m ~l,n-l < S -f T + / / U m , n dmdn - ft £ 1 

Jn-lJm-l *™m J 


where 

(7) 


(8) ft- 


00 n—L » tjt-l 

£ - £ £ uu, t = S £ Wi/, 

*"*ni J-l 7-n f -1 

/•*> a «' 

I I Mm,nd7ndn j I dwidn ^ Wm—i,/ 1 ^ ^ 

__ 1 ^ m ~ L _ Jn Jm _ J— n— 1 i=tn 


'Wnt—1,|»—1 + Vru.n-l 


A n alternate definition of k is 

^ ^ = ^ ^m,n dmdn ~ •Ujn.nJ -r — Wro,n)< 


« 00 


An illustration is afforded by £ £ (m + 1)“ 4 for which fc = ,45614, 

n—1 m-»i 

fc' — ,44586 when m — n = 10 in (8), (9). In this case (5) gave an error of 
”14 X 10“ 6 and (6) an error of 10 -6 . 

S and T may be evaluated by the method published in the Monthly. 6 
One must assume that k increases with m and n. It is evident that for this 


* Loc, cit. 
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method and for the one in the Monthly differentiability is not required but 
only integrability, conditions less restrictive than those required by the Euler- 
Maclaurin summation formulas. It- ia also clear that the method may be 
extended to multiple series of positive terms of multiplicity greater than two. 

Department of Mathematics Chester C. Camp 

University op Nebraska 
Lincoln, Nebraska 
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REPORT OF THE ANNUAL MEETING OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS 

The meeting of the Institute of Mathematical Statistics for 1936 was held in 
Chicago on December 28-30 in connection with the meetings of the American 
Statistical Association and the Econometric Society. 

In addition to the sessions at which voluntary papers were read, a session with 
invited papers was held on the morning of December 30. At the invitation of 
the Program Committee, Professor P. R. Rider presented a paper on "Recent 
Advances in Mathematical Statistics: Factorial Design" and Professor Harold 
Hotelling spoke on "The Analysis of Sets of Correlated Variates.” 

Professor C. C, Craig of the University of Michigan and Professor A. R. Cra- 
thorne of the University of Illinois constituted the Program Committee. 

At the business meeting of the Institute, the following officers were elected 
for the year 1937: President, Dr. W. A. Shewhart; Vice-Presidents, Professors 
P. R. Rider and B. H. Camp; Secretary-Treasurer, Professor A. T. Craig. 

The Institute voted that it would presumably hold its 1937 meeting with the 
American Mathematical Society, 

Allen T. Craig, 
Secretary. 


NOTICE TO SUBSCRIBERS 

Plans are under way to include in the Annals a new section, entitled “Numer¬ 
ical Illustrations of Statistical Methodology.” This new section will be a 
regular feature of the Annals, and will deal with the application of statistical 
technique and theory to the solution of problems in various fields. It is hoped 
that this new section will be of considerable value to those who are primarily 
interested in numerical applications of the more recent theoretical developments 
in mathematical statistics. 

The Editor will welcome contributions to this new section of the Annals. 
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REGRESSION AND CORRELATION EVALUATED BY 
A METHOD OF PARTIAL SUMS 

By Felix Beknstein 

“To be aure, Laplace viewed the matter in a similar way but he selected the 
absolute value of the error as a measure of loss. But if wo mistake not, this 
position is certainly not less arbitrary than our own; that is to say, whether the 
double error is to be considered just as tolerable as, or worse than, the simple 
error twice repeated and whether it is thus more fitting to ascribe to the double 
error only a double weight, or a greater ono, is a question which is neither in 
itself clear nor determinable by mathematical proof but has to be left entirely 
to individual discretion. 

“Furthermore, it cannot be denied that the assumption under discussion 
violates the principle of continuity and precisely for this reason the procedure 
based on it strongly defies analytic treatment while the results to whioh our 
principle leads have the advantage of simplicity ns well as of generality.”— 

F. Q. Gauss: Theoria combinaiionis obsertiationum, pats prior, orh 0. 

Since the “Theoria Combinationis” of C. F. Gauss appeared in the year 1821 
a century of Mathematical Statistics has been dominated by the ideas of this 
classical treatise—ideas whose fertility does not seem to be exhausted even 
today. 1 

The germ of most modern contributions to mathematical statistics—in fact 
also those of Karl Pearson and his school—go back decidedly to this paper. 
Though the immediate achievements of Gauss are so conspicuous as not to 
need any comment, a true critical appreciation of the work can be gained only 
by comparing it with the previous methods of Laplace, superseded by those of 
Gauss. 

For such critical appreciation, C. F. Gauss himself has prepared the ground 
in the lines quoted at the beginning of this article. To Gauss the standard 
deviation is a measure of uncertainty or risk of a game in which the errors of 
observation are considered as causing only losses. In this he follows the lead 
of his great predecessor. The difference between them is that Gaus9 adopts 
the square of the error as a measure of the loss while Laplace adopts its absolute 
value for this purpose. Either choice frees the error from its sign so that the 
loss is the same regardless of the sign of the error. 

Gauss considers this choice of the measure of the loss as purely conventional. 
.Therefore he feels justified in adopting the square of the error because in adopt¬ 
ing the square instead of the absolute value of the error, the mathematics he 
uses remains in the easily accessible domain of analytical processes, This 
creates for these methods a superiority in elegance, simplicity, and generality, 

The modem developments of mathematical statistics, based on the principles 

IT 
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of Gauss, have confirmed the correctness of this viewpoint. This has proved 
true particularly in the theory of analysis of variance developed by R. A. Fisher 
and in the more general theory of semi-invariants, first defined by N. II, Thiele. 

The inadequacy of the Gaussian method seriously impairing its value for 
statistical use has come to light through the investigations of ICarl Pearson of 
distributions of one and two variables. Since the moments of higher order 
involve standard deviations of increasing magnitude the characterization of the 
distributions by means of the moments, in lino with the Gauss-Thiele concepts, 
becomes practically impossible. Therefore it was of the greatest interest that 
Lindeberg was able to derive an expression for the standard deviation of a 
measuro of skewness constructed not on Gaussian but on Laplacian lines, 
namely based exclusively upon the sign of the error. The mathematical diffi¬ 
culties surmounted by Lindeberg by a very involved and difficult analysis— 
with some clearly indicated gaps in the proofs—are precisely of the character 
of those that Gauss wished to avoid. Encouraged by the success of Lindeberg, 
I have developed in two papers 1 the standard deviations of more general mo¬ 
ments and the correlations between them of which the mean deviation of Laplace 
and Lindeberg’s measure of skewness are special cases. The proofs have been 
arrived at by a rather simple and rigorous procedure. These new moments, 
together with the old ones, form a new system of statistical characteristics by 
which a distribution in one or two variables can be described by expressions 
of lower order and therefore of greater precision. This method makes un¬ 
necessary the use of moments of higher order than the third. 

But another point of interest is still involved. It has been assumed that the 
Gaussian characteristics give a greater amount of information than those of 
Laplace. This is proved, however, only for the case of the normal distribution 
fl 1 1 

g-* 1 * 1 This was recognized by Gauss himself in his paper of April, 1816, 

Vvr 

that appeared five years earlier than the Theoria Combinationis Observationum. 
In article 6 of his paper, he says, that the constant h of a normal distribution 
obtained from one hundred observations by the use of ■ the standard error is 
as exact as that obtained from one hundred fourteen observations in which 
the mean deviation is used. Hence with a given number of observations only 
the equivalent of 88% of the total are used by the second method. This does 
not hold true for all distributions. The following theorem can easily be proved: 
The amount of information as defined above, furnished by the use of the mean 
deviation is greater, equal to, or less than that furnished by the standard devi¬ 
ation, depending respectively upon whether 


1 Felix Bernstein: ^Die mittleren Fehlerqundrate und Korrelntionen der Potenzmo- 
mente und ihre Anwendung auf Funktioncn der Potenzmomente/' Metron, Vol, X, N. 3 
(Nov. 1932). 

Felix Bernstein: M Uber don mittleren Fehler der Potenzmomente.Zeitschr, f. d, ges. 
Vera.-WissenBcheft, Bard 30, Heft 3, March 1030. 
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(Pi - 1) | 4(fo - 1) 

where 


e Q = 


jUa 

J 1 


ft 


H 

2 

M2 


I 


/ifc the fc-th moment and i? = the mean deviation. 


For example, in the distribution ~ e * |ir| , the mean deviation furnishes a greater 

£i 


amount of information than the standard deviation, 2 

In the present paper, we shall discuss the practical use of expressions for 
correlation and regression in which the new type of statistics formed along 
Laplacian lines will be used. These new expressions are of a linear form and 
can be computed therefore more easily than those of Karl Pearson. The amount 
of information given by these expressions is less than that given by the expres¬ 
sions of Pearson if the normal law, in two variables, is fulfilled. For other 
distributions, however, this is not generally true. The determination of the 
standard deviations of these new expressions is given in Metron, 3 

The application of the new expressions of regression and correlation to grouped 
data is set forth here for the first time. The method is strongly recommended 
for all cases in which the data lose reliability with increasing deviations from 
the mean. Deviations in the new method enter the expressions only in the 
first degree and not in the second as in the case of Pearson's. It is obvious 
that the influence of the doubtful extreme readings is, therefore, considerably 
lessened! Since our expressions are linear, no adjustments for grouping (Shep¬ 
pard’s corrections) are necessary. 

It ought to be mentioned here that linear expressions for the measurement 
of correlation have been set up before. 

K. Pearson (Biometrika) and Egon Pearson (Biometrika) have derived an 
expression called “linear correlation ratio” which in case of linear regression is 
identical with the correlation coefficient. 

K. Pearson also discusses the linear correlation coefficient 


r — 


l (. g xs ov\ 

2 \ xsgx + ysgy)' 


1 To this second type of distribution curves also belongs y = where ${$) is the mean 
of two Gaussian curves with the same origin, i.e, 

\V t V ir / 

1.6 < k < 3.4. 

I owe this remark and some other valuable suggestions regarding the subject of this 
paper to Mr, Myron Fuchs. 

1 Op. cii . 
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suggested by Lenz and various other linear expressions, all similar to our expres¬ 
sion (1). He finds that they are all equal to his quadratic correlation coefficient 
in the case of a Gaussian distribution. 

However, their expressions were not recommended by those authors for the 
determination of correlation between quantitative variables, because— 

1. No easy and practicable methods were given for their evaluation in the 
case of grouped data. 

2. Their standard deviations were not determined. 

We now proceed to define the new formulas and to describe the methods for 
their evaluation. The proofs are furnished in the Appendix to this paper. 


Let i*i and r$ denote the regression, coefficients of $ on y and y on % respectively, 
and r, as usual, the coefficient of correlation, and by £ and y the arithmetic 
means of the x's and y’s. Let us take x, y as the origin, so that x, y are the 
deviations from the mean, We have 


( 1 ) 


Ti 


n 


jSx 

±v 

Sy 

+v 

Sy 


or ri — 


or r 3 


Sx 
+x 

r = VnxT a 


Sx 

-V , 
Sy 

-y 

Sy 

— 35 

— X 


Sx denotes a partial sum of the x’s, this sum being extended over all the x’s 

+ 1 / 

of the observations whose y is positive and the other sums have a corresponding 
meaning. 

It should be noted though that if data occur whose ^-deviation is 0 (practically 
never in a grouped table) one-half of the sum of these x's should be added to /Sx. 

.... . 

In the S a' similar addition should be made in case observations occur in which x 
-\-x 

is zero. (See Table IV.) 

The formulas (1) and all following ones will be proved in the appendix to this 
article. 4 


4 Using t\ and rs of (1) the regression lineB are y « n% and x » ny. They are those 
straight lines which fit the data best according to the method of least squares, if the weight 
of the deviations iB taken inversely proportional to the absolute value of the variable. 
Taking x for instaneo as the independent variable, r t is the value of m which minimizes 

w*) 1 (the sum extended over all data x y ), 
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The standard deviations of n and ?'2 are 


2 


a- 


ri 



(1 -f m(m — 2r)) 


( 2 ) 


2 


= ^ (1 + n(n - 2r)) 


iSs 

where m — ~ 

Sx 

+y 

Sy 

where n = 

Sy 


We are now going to illustrate the computation of r and for this purpose 
we shall use a table of Pearson’s which gives the correlation between the heights 
of fathers and daughters. 

The totals at the right and lower end of the table are fust computed and 
the bracketed numbers are the sums of the numbers that precede. The 
means are 


1659.5 - 1179 480,5 

1376 ~ + 1376 


1650.9 - 1390 260.5 

1376 " + 1376 


whose signs determine on which side of the working mean to “quarter” the 
table. This quartering is done in Table 1 by the lines vv and hh. Then the 
totals above the heavy horizontal separating line hk and those to the left of 
the vertical separating line vv are found, e.g. 2, 4.5, 7.25, • ■ • and .6, .5, 0, 
Multiplying these totals by the respective class marks, we find the outside lines; 
18, 36, 60.75, • • • and 5.5, 5, 0, 

Sx is now = 1107.5 — 420.5 = 687, and an adjustment for the fact that a 
-y 

working mean has been used has yet to be made. This adjustment is xN _y 
where N~ v is the number of negative y' a. (N~ v — 728.) 

We have therefore for the adjusted values 

Sx = 1107.6 - 420.5 + ^-728 = 825.07 
—y 1376 

Sy = 1179 + ^^-728 = 1433.21 
~y lo/o 

T\ = .5757 n = .5170 

r == .546 

The standard deviations, according to the formulas (2) are 

er r , = .031 tr,, — .027 



Correlation between Heights of fathers and Daughters 
x — Height of Tatters y i Height of Daughters 
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Tforiduff ^Mean x = 67.6 

Class lvidth 1 loch 
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The standard deviation of r 2 = r x X r* has to be estimated by using the 
general formula for the standard deviation of the product c of two variables 
a and b; 


2 

<?o 


2 


a» + ^ + 


2^CTq (J 

ab 


R being the correlation coefficient between a and b. Since -\ < R < + 1, 
substitution of these limits for R leads to the inequalities 

(?-?)’<7 <(; + ?)' 

putting a - ri, b = r 2 , c = r we have 


Vr, 0^2 

ri ?’ 2 r ri ' r s 


Considering the relation o> = 

It 

we have 2r (cr rj ?- 2 — o- ri ri) < <r T < 2r (o- r , + <r rj r t ) 

from which we derive with sufficient approximation 

<r f < ' 030 

A slightly different arrangement for computing r has been made in the 
following table. 

TABLE II 


Correlation bekoeen diameter of the stem and length of the lonest flower petal of 
, Trienlalis europaea* 



PS 

3 

15 

34 

45 

30 

0 

2 

0 

0 

0 

0 


PS 


-4 

-3 

-2 

-1 

0 

1 


3 

4 

5 

0 

Total 

1 

-4 

1 











1 

7 

—3 

1 

4 

1 

1 








7 

29 

— 2 

1 

9 

16 

3 

1 








33 

-1 


2 

9 

22 

9 

2 

1 





45 

27 

0 



8 

19 

20 

4 

1 





62 

8 

1 

1 



7 

18 

.12 

6 

4 




48 

I 

2 




1 

8 

9 

3 

2 

1 



24 


3 






3 

6 

4 

1 



14 

1 

4 







2 

2 

1 

2 


7 


6 









1 

3 


4 


6 









1 


1 

2 

Total 

4 

15 

34 

53 

66 

30 

19 

12 

5 

6 

1 

234 


*E. Czuber: Die statistischen Forschungsmethoden, Wien, 1921. 
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table hi 

x = Diameter of the s(em. 

y = Length of the longest flower petal in millimeters. 
Working mean, a) flI - .825, Vm = 34.5. 


Class width 

of x =.. 

4 mm. of y »* 6 mm. 





Total 

P.S. 


Total 

P.S. 

X 

X 

times x 

V 

times y 

times y 

-4 

16 

12 

-4 

4 

4 

-3 

45 

45 

-*3 

21 

21 

—2 

68 

68 

-2 

60 

58 

-1 

63 

45 

-1 

45 

33 

0 

(182) 

(170) 

0 

(130) 

(116) 

1 

30 

6 

1 

48 

8 

2 

38 

4 

2 

48 

2 

3 

36 

0 

3 

42 


4 

20 

0 

4 

28 


6 

25 

0 

5 

20 


6 

6 

0 

6 

12 



(155) 

(10) 


(198) 

(10) 

Mean 

-27 



+68 



The P.S. columns are the partial sums as explained in the previous table. 
The work of multiplying the totals by the class marks and of adding them has 
been separated here from the table. 

We obtain N « 234, AL* - 100, AL* = 136 

V7 

170 — 10 - ~ X 135 

i-i ---- .805 

130 + g X 135 


116 - 10 + 2L X 106 


182 — 2^4 X 106 
r = .82 

Pearson's coefficient for this table is r — .83. 

Finally we illustrate by a small non-grouped table where the partial sums 
can be written down immediately. 
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TABLE IV 


Correlation between Ages of Husband and Wife 


Age of 
Husband 

Age ol 
Wife 

Deviation 

Husband 

Deviation 

Wife 


22 

18 

-8 

-8 


24 

20 

—6 

-6 


26 

20 

-4 

-6 


26 

24 

— 4 

-2 


27 

22 

-3 

-4 


27 

24 

-3 

—2 


28 

27 

-2 

+ 1 


28 

24 

—2 

-2 


29 

21 

-1 

-5 


30 

25 

0 

-1 


30 

29 

0 

+3 


30 

32 

0 



31 

27 

+1 

+1 


32 

27 

’ +2 

+ 1 


33 

30 

+3 

+4 


34 

27 

+4 



35 

30 

+6 

+4 


35 

31 

+ 6 



36 

30 

+6 

-f-4 


37 

32 

+7 

+6 


Ave 30 

26 





Here O-deviations occur in the third, column. Hence* 

Sy = 26 + £ X 8 = 30, Sx = 33, Sx = 31, Sy = 30, 
+* +* +y +y 

r\ =* . 86 , r 2 =! .91, r = .88 (Pearson's r = . 86 ) 

Appendix 

Proof of formula (1), page 1. The following notations will be used: 
(/(x))° — probable value of f(x) 

— probable value of f(y) for a fixed 

+ 1 

X 1 Sj 

sgx - sign of a; = t—, for x 0 . sgx = 0 if x % 0 , 

I *1 _1 


1 Seq page 7. 
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The assumption of linear regression means that 


( 4 ) yl - = r vi3t (x - ®°) 

We multiply both sides of (4) by some arbitrary function <f>(x) of x and get 

(yl - V a )4>(x) = r v - x (x - x a )4>(x). 


Both sides are functions of x. We shall take their probable values for all x*a. 

Now, for a fixed x, yl<f> (a:) = (y<f>(x))l and the probable value of (y4>(%))l for 
all s's is equal to the total probable value (y<j>(%))\ So we have 

m*))* ~ = W(s - z«)<Kx)) 0 


( 6 ) 


— 


((V - y%ml 

((x - a 0 )^)) 0 


If now we take *V 115 the origin, we get 


_ (y<l>(z)Y 
v ' x (xy>(x))° 

and similarly 

_ (^i(y)T 

* ~ (#i(2/))° 



where is another arbitrary function. 

Replacing the probable values by the respective arithmetic means we get 


( 6 ) 


.. _ SW»U) 

= Sxfa) 


and 


Sx4i(y) 

Sy<h(v) 


with x, g as the origin. 

By a suitable choice of the still arbitrary functions <f> and , we may derive 
all the various expressions for regression coefficients. Taking, for instance, 
'$(*) - x, 4>i(y) = y, we get Pearson’s expressions. Taking ff>(x) — sg(x — <*i), 
4>1 (y) - sg{y - CC 2 ), OCX and a 2 being constants, we have 


(7) 


„ _ Sy sg(x - «i) 

- Sx sg(x - ^' 


9 

and if we make cn *= era = 0 


(3) 


Ty\x 


Sysgx 

Sx&gx’ 


_ Sx sg(y - a 2 ) 
« v ~ Sy eg(y - «.) 


Sx&gy 
** Sysgy 


Since Sx~Sy = 0 , we can. add Sy or Sx to the numerators and denominators. 
Adding Sy to the numerator, Sx to the denominator and multiplying both 
sides of the fraction by $ we get 


ISyj&gix z ai) + 1 ) 

£&E(sg(:c - a-i) + 1) 


0 ) 
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Instead of (9) we can write 


( 10 ) 


8 y+ $S y 

„ _ X > ai X- at 

v:s ~ S x+ %S x 

X > <Xi X = cti. 


since the operations of (9) multiply the y ordinates by 0, 1 according as the 

ai’s are 1= «i. 

The expression (10), with a suitable choice of ai should be used for the purpose 
of numerical calculation of r. For instance, when calculating r from the data 
of Table IV, we took ai = — 0 and had 


Sy -h h S y 
+s x = 0 
Sx 


When dealing with data which are arranged in a grouped table (Tables I 
and II) we take «i equal to the 2 -ordinate of that classline which is nearest to 


the mean 


■( 


In Table I «, = .5 - 


With that choice of a* the sums 


S disappear and the sums S are equivalent to the corresponding sums 
x = ai x > ai 

S, Hence we have 


+* 




Sy 

Sx 

(11) 

r v: * = and similarly 

Sx 

T - +V 

m - Sy 


■J-X 

+y 


Instead of (9) we can also wfite 


(9a) 


- on) - 1) 

“ }&(«(* - oO - 1) 


This leads to 


(1 la) 


Sy Sx 

= and r ’ ;B = 

-X -y 


• It is desirable to chose the absolute values of the a J s small so that the maximum number 
of data enter into the calculation of r. However, to take aj = at ^ 0 would necessitate a 
division of the middle arrays of a grouped table, a laborious process. Hence the ohoice 
of the ft's as described above. 
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Proof of the standard deviations of Formula (2). 

In my article on standard deviations and correlations of moments 7 the stand¬ 
ard deviations of the expressions used in this article have been derived. 

In the following, the notation of the Metron article just referred to will be 
used. We use the symbols: 

P/m.n = 2 (C m sgxy n 
Pmjn = H$ n y n m 
P/m.in = 2 x m sgxy n sgy 

The summations indicated extend over all observations. The true or prob¬ 
able values of the same expressions are indicated by using p instead of P. 

I 

Pi/o 

r x;t/ = ri =* W L 

We derive the standard,deviations by defining the deviations as first variations. 

log n = log P„, - log P an 


fr __ 3 Pi/o ^ iPo/i 
ri ~ pi/c pm 

(12) ct \ = [(Sr,)Y =. (ri)* [ (*£ - ^YT 

L\ Pm Pon ) J 

The probable values of the terms on the right hand side of the last equation are 
derived on pages 17-19 and listed on pages 32-33 of the Metron article referred 
to. The proofs which imply essentially a process of variation of Stieltje's 
integrals will not be given here. From pages 32-33 we take 


(13) 


so that 
(14) 


[(sp,„)*r - ^, rc«vi)T = 

KPwiPt/iT = ?“ - j p i ' ,Wi 

j _ 1 ( '\i i 2^ ~] 

ri W 1 Lpi/o Po/i Pi/oPo/iJ 


Assuming Gaussian distribution, we can put 


v a 


Vm 


* * 
2 pan 


Pn a* rVpciPio = r ^P/ioPo/i 


1 Felix Bernstein; “Die mittloren Fehlercjuadrate utid Korrelationen der Potenzmo- 
mente und ihre Anwendung auf Funktioncn der Potenzmomente,” Metron. Vol. X, N. 3 
Nor. 1932 ). 
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Hence 





1 + ^-2^ 


Vm 


Vm/ 


Replacing the theoretical values by their corresponding empirical values, 
we have 


(16) <r!, + 2rm) where m = ^ S - ^ 

Sx sg y 

The formula for v n has been derived here for the value of ri as given by (8) 

i.e. n = - . In fact, we used n = ^ ^ in the examples in the 

Bytov Sysg{y-a) p 

article, and a had some value absolutely smaller than .5, To use equation (16) 

for the standard deviation of ii is within the limits of the required degree of 

accuracy; hence we shall disregard the difference. In a later paper the standard 

deviation of n for any a will be derived by using the method describecftn the 

Metron article, for a different purpose. 

To prove the statement in the footnote to page 7 

To find the value of r 2 that makes 


Sf(x) (y - r a e) 2 a minimum. 


By differentiating we get 

SJ{x)(y - rtx)x = 0 

Sxj(x)y 

Sxf(x)x 

If ffy = 1 we get Pearson's coefficient, 

If/(&) - n 0) we get 
1*1 

X 

= sq* 

„ x Sxsgx 

Ol.J® 


New York Univhrbitt, 

Departments of Anatomy of the Graduate School and the College of Dentistry. 



METHODS OE OBTAINING PROBABILITY DISTRIBUTIONS' 

By Burton H, Camp 


The emphasis of this paper will be on method. Special results will be cited 
in order to illustrate the methods rather than to summarize achievement in the 
field; for that has been done already by Rider (1930, 1935) Irwin (1935) and 
Shewhart (1933) in recent surveys. The purpose is to describe and to illustrate. 
most of the methods that have been used to determine exact probability dis¬ 
tributions, and to show that they are all derivable from one fundamental theorem. 
In order to prove this unity in a simple manner, it will be desirable to omit from 
consideration methods which are essentially ingenious forms of counting, such 
as are used in sampling without replacements from finite universes, and in 
finding the sampling distribution of a percentile. 

The general problem to he discussed may be stated as follows: N individuals 
(h, * • • , lif) are drawn, one at a time with replacements, from a universe whose 
probability distribution is A certain single valued function of the t 1 s is 
formed. This is called a parameter of the sample, and is frequently also, 
but not necessarily, a useful estimate of the corresponding parameter of the 
universe. The problem is to find its probability distribution,/(a). As usual, 
a probability distribution is a function which is required to be defined, except 
perhaps at a set of measure zero, throughout the infinite domain of its variables; 
it is nowhere negative, and its integral over its domain is unity. 

Most of the more recent developments of the theory relate to a more general 
form of this problem. Instead of N individuals, there are N sets of n individuals 
in each set, and these sets are drawn respectively from M(M £ N) universes, 
each of which is described by a function of n independent variables, thus: 

(1) V"((., 


Instead of a single parameter there are P parameters, and each is a single valued 
function of the observed values of the nN individuals in the sample, thus: 


( 2 ) Xi - g { {t 


(i) 




3 j 


r, 


4”); (i = l, ■■■,?) 


The first method to be described is fundamental and will be designated as 
Theorem I. Let it be required that each g as described in (2) be not only 
single valued but also constant at most in a set of measure zero in + ho wJV-way 
space of the i' s. Then 

(I) " •, *<■) dX = I , i<»>) dT 


1 Presented to the American Mathematical Sooiety at a meeting devoted to expository 
papers on thB theory of statistics, April 11, 1030: 
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where X is the space of x’s and T the space of the i’s, p is any measurable set 
of points in X r and q is the set in T for which g is in p. Often p is the P dimen¬ 
sional cube fa A x, i = 1, • • • P ) at the point (xi, - ■ • , x p ) and then q is 
the set where 


(3) Xi £ Qi ^ Xi + Ax; (i = 1, • ■ ■ , P ) 
and <j> is the simultaneous distribution of the sets of i’s, 

/AS rCl)/j(0 j( 1)\ i (If) /j(J?) 

(4) 0 (Sl I ■ ' 1 » H ) • • * 0 (ti , • • * , t n ')• 

In this 0 C,) is the universe from which the t U) set of £’s is drawn. Obviously, 
if N > M, some of the 0 (,) 's are identical, and then it is assumed that the several 
sets are drawn independently. Often, all Of the N sets of i’s are drawn from 
the same universe. Then M — 1 and all these <£ J s are identical, and (4) becomes 

* - [AiS'V ■ • •, a • • • • • ■, Ol. 

In the special case where there is but one parameter (P = 1) and but one 
individual in the sample (n = N = 1), and p is an interval, formula (I) becomes 

fx+Ax r 

(la) j f(x) dx = <f>dt ; 

and in the very special case where it is also true that q is an interval it becomes 


(lb) 


f(x) = 0(f) - 


(ft 

dx 1 


provided also that certain derivatives (to be specified Later in the proof) exist, 
where t is now the inverse solution of the equation, 


(5) 


® = ff(0- 


The proof of formula (I) is immediate, if one is willing to ossum^the existence 
of the probability distribution /; for then the left side is by definition the prob¬ 
ability that the x's lie in p, and this is also the meaning of the right side of (I). 
(Ia) can be proved without assuming initially the existence of f(x), for then 
the existence of f(x) can be inferred from the existence of the right side of (Ia), 
because f(x) may be set equal (except perhaps at a set of measure zero) to the 
upper right hand derivative, with respect to Ax (Ax is a variable, and x is fixed), 




of |0 (ft, provided that one adds the condition that this derivative is nowhere 


infinite, The point at issue here is merely the existence of a primative for a 
monotone increasing function of Ax. (Ib) may be derived from (Ia) by taking 
the derivative of both sides with respect to Ax, if the derivatives are continuous. 

Theorem I, in these various forms is used a great deal, especially in the last 
form (Ib). This affords one freedom to choose the most desirable function 
for purposes of tabulation. R. A, Fischer’s z distribution, a logarithm, is an 
important illustration. Many authors have been interested in so choosing the 
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function that its distribution shall be normal, They include several of the 
older writers, and more recently H. L. Rietz (1921, 1927), and G. A. Baker 
(1932, 1934). However, the theorem is of special importance in the theory, 
for all the other principal methods of obtaining probability distributions are 
essentially corollaries of it. These corollaries will be called Theorems II, III, 
and IV. 

Theorem II. Let p (the measure of p) and q (the measure of q) be infini¬ 
tesimals of the same order and let both the oscillation of maximum /- 
minimum /) in p and the oscillation of $ in q be infinitesimals; then (I) may be 
written, 

(II) /p = <f>q, 

where / applies to any point of p and to the corresponding point of q. This 

equation (II) is an approximate equation in the sense that differences of higher 
order than those retained are neglected. In particular, with the conditions 
used in formula (la), equation II becomes 


fAx = <f>q , 


The left side of (II) is an approximation to the probability sought. The right 
side shows that, in order to evaluate it, one need only find the volume in T space 
of the differential element q and multiply it by the value of <p in q. Formula (II) 
expresses the so-called geometrical method used by many authors, c.g. f by 
R, A, Fisher (1916, 1926), by Wishart (1928), and by Hotelling (1925, 1927). 
The chief difficulty in connection with it is in finding the volume of ^-dimen¬ 
sional q. In order to display the advantages and disadvantages of this method 
wc shall pause at this point and look at a concrete example. 2 

Let two individual (fi, fa) be drawn independently from a normal universe 
and consider the simultaneous distribution f(x, y ) of the sum, x = k + k, 
and product, y = kU, the mean of the universe being chosen oa the origin. 
Here N — 2, n — 1, M — 1, and so, 


(6) 




<t> - 




1 

27Tff a 


“ 2^5 “ 2 **> 

e 


The point set q is the area lying between the two adjacent hyperbolae, 

kk = y, Ms = y + A y, 
and also between the two adjacent lines, 

k + fa = %, fi + fa “ X + Ax, 

where Aa; and Ay are infinitesimals and are equal. This area may be computed 
by simple integration and is: 


* See nlao G. C, Craig (1936). Craig uses another method to be explained later (formula 
Ilia). 
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__ 2Ax Ay 
Vs 2 — 4 y 
= 0 


Hence II gives us immediately the desired result: 

’ _ 

1 I 

fix, y) AxAy — —- e 


TC(F 


\/ x 1 — Ay 


AxAy , 


= 0 if x 2 < 4 y. 


if x 2 > 4 t/, 
if x 2 < 4i/. 

if x 2 > Ay, 


If x' = Ay, q is an infinitesimal of lower order than p = (Ax) 2 , and so Theorem II 
does not apply. In this case we must go back to Theorem I, and from that we 
can learn that the probability, 



/ dx dy, 


is an infinitesimal of the first order if p = Ax Ay = (Ax) 2 is of the second order. 
Hence it cannot be approximately represented by a finite number times p. 
The oscillation of / in p is infinite. The form of the surface /(x, y) is interesting. 
The ordinates rise to infinity on the contour of the parabola x 2 = Ay, and vanish 
within it, The surface is symmetrical with respect to the plane x = 0, but 
not with respect to the piano y — 0, However, it is clear that the total prob¬ 
ability of any given product, y (i.e, the probability of this y for all possible 
values of x), is the same as the total probability of hence 



and the corresponding formulae, 



and 


V 

2 r 

irtr 2 jo 


3 3 


1 


dx 


7T(T“ JO yV — Ay 

must he equal; both may be reduced to the single form 



(y > 0), 


iy < o)> 


if y 0. 


This is the probability distribution of y. 

With this example before us, let us now reconsider the theory: 

(i) The requirement (in II) that the oscillation of <j> be infinitesimal in q 
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will be satisfied if one can show that may be expressed as a continuous function 
of the parameters (an, Xi, •xp), In our example these parameters were 
x and y and <f> was so expressible (6). But if we had tried initially to find by 
means of (II) the distribution of the product y, independently of what values 
x might have; we should have been stopped at this point, because <j> is not 
expressible in terms of y alone. We should also have been stopped by the 
requirement that q be infinitesimal of order Ay, for q would have been the 
space between two hyperbolas and its area for any fixed (Ay > 0) would have 
been infinite. But, when thus stopped at that first point, it would have been 
clearly indicated to us that the distribution of y might have been found via 
the detour of finding the simultaneous distribution of both x and y, because 
an attempt to express $ in terms of y would have led to the given expression in 
terms of both x and y. For a similar reason R. A. Fisher (1925) was able to 
find the distribution of the variance by finding first the simultaneous distribution 
of the variance and the mean, Also, he was thus able to find the distribution 
of the coefficient of correlation by finding first the simultaneous distribution of 
alhthe first and second order moments. 

(ii) A distinct advantage of this method is that q is independent of the 
universe <j>, so that once found it may be used in connection with any universe 
which satisfies the condition that it can be expressed as a continuous function 
of the parameters. Thus, the distribution of the sum and product in our 
example may equally well be found for the universe described by the Type III 
curve, Ate~ ai (t > 0). For, then 

* = A 1 ti U e~ at,|+f * 1 _ 4 5 ye““ 


and so, using one-half of the same ? as before, since now x, y ^ 0, 


J{x, y) = A 2 ye ” 


= 0 


V^ 2 — 4 y' 


if 

if 


From this, F{y) can be found by integration (c.f. Kullbaeh, 1934) 




- A'y f 


-ax 


vT, V& - 4y 


dx 


I 2 p co ' 

_ Ajt f e _ 

2 Jj u 


du. 


x' > 4 y, 
x < 4 y. 


As another illustration, consider a normal universe of n intercorrelated vari¬ 
ables in which all the total intercorrelations are equal to r (e.g., the statures of 
n brothers) arid let the sample be a single group of n (one individual for each 
variable). 

l - n b'* '! +tl (»,'“(] 

* = R e ’ 

where R » (1 — r)""‘[l — (ft — l)r], h = (1 - r) ft_! [l — (n — 2)r], and 
h = — r(l — r)" . Suppose one wishes to find the simultaneous distribution 

t 
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of the variance x and the mean y for such samples. 3 Since for Student's problem 
Fisher has found the value of q for this x and y to be 

q = cx 2 AxAy, 

their distribution f(x, y) for this universe may be written down immediately. 
In'terms of x and y the bracket in the exponent of 0 is y 2 (fan - fan + fan 2 ) 
zn(fa — fa), and so f(x, y ) is the product of q and this form of <fi: 

f(%,y) = Ke s z 2 , E = - 2 ^; [(bn - fan ■+■ fan 2 )y 2 - n(fa - fa)x], 

(in) Another attribute of this method is that it sometimes lends itself to easy 
extensions from a simple case where there is only one restriction (N - 1 degrees 
of freedom) to similar cases when there are more restrictions. Thus R. A. 
Fisher (1924) proceeded from the variance of a sample from a single universe 
to the variance from a set of universes, as required in the theory of analysis of 
variance; and thus also (1915) he had proceeded from the distribution of r to 
that of multiple 72; and Hotelling (1927) showed how these distributions could 
be obtained when the values of each variate were themselves intercorrelated 
(as in a time series) and not merely correlated with values of the other variates. 

Theorem III. Now let us consider again the fundamental form (I). For 
convenience let nN = m. If the conditions will not permit us to write the right 
side in the form in (II), it is still possible that we may be able to find that 
(m + l)-dimensional volume by some other method. In particular, whenever 
it is possible to iterate the integral once we have the formula: 

(III) f fdX = ( dT' f 4>dt m , 

Jv JT f Jq „i 

where q m is the section of q by t m space at the point (ti, • * - , t m - 1 ) of, T* space, 
T‘ space being the space of the (h, • * • , fa-i) coordinates. With added condi¬ 
tions one may deduce from (III), for the case where there is but a single para¬ 
meter x , the approximate equation: 

(Ilia) fdx = dx [ dT' ■ 4,(h, 

j T 

in which t m is supposed to have been expressed in terms of the other coordinates 
by solving the equation x = g(h /"■ , Q, It is an approximate equation in 
the same sense as (II) was. Sufficient conditions for this change in the left 
side of (III) have already been mentioned in discussing (II), The propriety 
of making the corresponding change in the right hand side may be left for 
determination when the form of £ is given. It will perhaps be sufficient here 
to point out that our earlier example illustrates both the case where this change 


3 A special case of a more general problem solved first by R. A. Fisher. 



96 


BURTON H. CAMP 


is permissible and where it is not. For, let it be required to find the distribution 
f(y) of the product y = Uh without reference to the sum, fr + la. Formula 
(III) yields 


(7) 


'l/+AV 


r*> y , (i/+Av)/fi 

f(y) dy ^ 2 / dh dt 2 

J0 Jyl l \ 


1 

2r<r 2 



a 


This is valid for every value of y including y = 0. If y ^ 0, we may change 
the right hand side ns in (Ilia) and obtain as the probability that y is in the 
interval ( y } y + Ay ): 


( 8 ) 




(Iti + e, 


where £ is a differential of higher order than Ay. This may be proved by com¬ 
puting the difference between the value of (7) when k has constantly the value 
{y Ay)fti and when it has constantly the value y/k. If y = 0 this change 
in the right side of (7) is not valid; it is easily seen that in this case the integral 
on the right of (8) is infinite. It may be shown, however, in this case that' 


(») 



dy = \ - 


i r e dx 

2?T Jl tc 's/\x 2 — 1 1 


and that this is an infinitesimal, and that it is of order as small as one. 

Many authors think of (Ilia) as the fundamental formula in the theory of 
probability distributions. One of the simplest and earliest applications of it 
was to establish the so-called reproductive property of the normal law: that 
the sum of two variates is distributed normally if each is distributed normally. 
Jackson (1935) has used it to establish a similar property for two Type III 
distributions which have the same exponent of e. Usually this integral is 
difficult to evaluate when N > 2 because of the unsymmetrical form into 
which it is cast, but when N = 2 and there is but one parameter (Ilia) it is 
perhaps the most convenient of all the formulae. 

Theorem IV. An exceedingly useful formula is obtainable from (I) in the 
following manner. Let'£>(mi, ■ ■ ■ , x P \ ai, , a Q ) be a finite single valued 

function of the old parameters ($) and of some new parameters (a). Subject 
to general conditions to be stated we may write: 

(IV) jf 6f dX - [ g't dT, 

an identity with respect to each a, where d 1 is the result of substituting (2) 
for the ai's in 6. 

Since this theorem has not been proved in this general form, an outline of 
the proof will be given. Sufficient conditions are: 

(a) All the integrals involved shall exist. 
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(6) If p is limited (in the sense that it lies within a finite hypersphere), so 
is g, and conversely. 

Proof. Let Xo be a limited p set and Tq the corresponding q set such that 
both (c) and (d) hold (e > 0): 


(c) 

(d) 




< 

< €. 


It is easy to see that such an and a corresponding T a do exist, as/ollows: 
Let Xo be a limited set for which (c) is true, and for which it will remain 
true no matter what points are added to Xj, Similarly, let 2 T 1 be a limited 
set for which ( d) is true and for which it will remain true, no matter what 
points are added to Ti. Presumably Xo and 7 1 ! do not correspond to each 
other, but we may now let Xj be the totality of all the points of Xj and of all 
those points of X corresponding to T 0f and let Ta be the totality of all the 
points of T'o and of all those points of T corresponding to X f a . Then X 0 and 
Ta do correspond to each other and have the desired properties (c) and (d). 
Now, since 6 is finite, it is limited in Xo. Let 

(e) |0|<ffmX o . 

Divide the interval (-H, H) into s equal subintervals of length k, thus defining 
in X 0 according to Lebesgue the measurable sets, 

Pi (i = 1, • ■ • , s), and corresponding <?,■ sets in jT 0 : 

Os 0 £ h in pi, 

(/) 

Os &' ^ k in qi. 

Choose arbitrarily any point of pi and let hi be the corresponding value of d. 
Then let 

6 = ki in pi (i = 1, ■ ■ ■ , s), and similarly let 
6' = hi in g, (i = 1, • ■ • , s). 


Then 
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Now 


Mo 


{d~$)fdx 


g f \ 0 -e\fdX gh f fdX, 

JX o Jx 0 


and 


f (0' - 6') dX gh f <i>dX 

Jt o JTb 


So, as h approaches zero both sides of (g) approach limits and their limits are 
equal: i 

I dfdX = [ 9'4>dT, 

Jx 0 Jt 0 

Hence by (c) and (d) the integrals 

jdf d$f 

differ at most by 2e, and so, being independent of e they do not differ at all. 

In order to determine the form of / from (IV) one must first evaluate the 
right side, 

fyfydl = , a 9 ); 

and then solve the integral equation, 

( 10 ) J*0fdX = t, 


It is the solution of this equation that usually presents the most difficulty, 
Particular forms of 9 that are being used are 

(11) e = e a ^"\+ a p*p ) 

in which case f is said to be the "characteristic function” or "moment generating 
function”; and 


( 12 ) 9 xV, 

in which case ^ is a "moment function” or "moment" of /. Other forms might 
be used, For example, a very convenient method of demonstrating the correct¬ 
ness of the usual formula for the simultaneous distribution of the correlation 
(x), means {y f z), and variances (u f v), in samples from a normal bivariate 
universe is by the use of 

ff — <■“* H- p * + i/ J + « 4 ) + + 

This method of finding / is not a final determination of the probability function 
desired until it has been shown that the solution is unique, a serious problem 
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in itself; it is one of those which Professor Shohat may consider. 11 There are 
three methods of solving the integral equation (10): 

(i) The first might be called guessing. Though unscientific, it is in fact 
often effective. Especially is it available if the distribution has already been 
surmised but not demonstrated. Thus, it was open to Student (1908) when 
he correctly surmised the distribution of the variance. Similarly it was open 
to Soper (1913) when he incorrectly surmised the distribution of r, 

(ii) Papers by Romanovsky (1925) and Wilks (1932) have shown how the 
problem of solving the integral equation may be shifted to the problem of 
solving a partial differential equation, but this in turn may involve the solution 
of another equally difficult integral equation in the process or determining the 
arbitrary function, 

(m) If each a be replaced by an imaginary pi and one uses a Fourier trans¬ 
form, one arrives at a set of formulae which are most important. For the case 
' where there is but one x and one. (3, they may be written; 

(13) [ e^fix) dx = f e ifl <f,dT = +(fi). 

J—nt Jr 

(14) fix) - i e-" m de. 

Dodd (1925) has given an equivalent set of formulae involving only real vari¬ 
ables. It is easy to prove that both sets may be changed to the single formula, 

(16) fix) - 1 f 

7T JT 

Kullbach (1936) has established the validity of the formulae corresponding to 
(13) and (14) for the general case of (P + Q) parameters. Wishart and Bartlett 
(1933) used the general forms to find the distribution of the generalized product 
moment in samples from an ri-dimensional normal system. 

When the solution of the integral equations of (IV) cannot be found, one 
has to put up with the semi-invariants or with the moments of /, Formulae 
(IV) and (11) yield the semi-invariants, (IV) and (12) the moments about the 
given origin, and from either of these one may obtain the moments about the 
mean point. These methods are old but they are still important. Time does 
not permit me to discuss them, because it would not be proper to close this 
paper without some reference to limit methods. 

Limit Methods. It is well known that the distribution of means of samples 
taken from almost 6 any universe approaches the normal law as a limit as N 
becomes infinite. This theorem is subject to great generalizations,' as is indi¬ 
cated in papers of A. Liapounoff (1901), S. Bernstein (1926), Romanovsky 


* In & later paper at the same symposium, 

6 There are exceptions. E. g means of samples taken from the universe a/ir(a + t 1 ) 
have a distribution identical with the universe itself. 


4> dt jf cos fi(x - g) d/3. 
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(1929, 1930) and C. C. Craig (1932). Subject to very general conditions it 
has been shown that: If the characteristic function of one probability distri- 
bution contains a parameter and approaches as ft limit, uniformly in every 
finite domain of its variables, the characteristic function of another probability 
distribution; then the first distribution approaches as a limit the second distri¬ 
bution. Hence S. Bernstein and Romanovsky have shown that: If the universe 
is an n-way correlation solid of a certain very general type, then the n means 

obtained by a selection of a sample of N sets of variates, Xi = (4,-, + • • • -f 

(i = 1, ■ ” , n), have a distribution which approaches as a limit a normal 
correlation solid as N becomes infinite, A similar theorem has been established 
also in the interesting case of Romanovsky's "belonging coefficients”, which 
include K. PearBon’s coefficient of racial likeness. Also, by the method of 
maximum likelihood, Hotelling (1930) has proved that under certain general 
conditions all optimum estimates of the parameters of a frequency distribution 
have a joint distribution approaching the normal as N becomes infinite. The 
validity of the method of maximum likelihood when used for this purpose has 
been established by J. L. Doob (1934), 

Finally, one may note an apparently new limit theorem of another type. 
Its general nature will bo obvious from the following application: 

Let a sample of N be drawn from the universe, 

<t> = Ae~ a '*\ if t > 0, 

; =0 if L ^ 0. 


It is readily proved, by means of (IV), that the distribution f{x) of the para¬ 
meter, 

£ = (ll + ■ ■ • -Hff ) 

is a curve of the form, 

f(x ) = Bx N ~ l where x > 0, 


= 0 elsewhere. 


Now let X become infinite. The universe approaches as a limit the rectangle: 

= A where 0 g i < 1, 

= 0 elsewhere, 

The parameter % approaches as a limit X, where X = maximum ti. The 
distribution /($) approaches as a limit the new distribution, 

F(X) - NX N ~ l where 0 < | X \ < 1, 

= 0 


elsewhere, 
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Hence we have proved in a new way, what was already known: that the distri¬ 
bution of the greatest variate obtained by sampling from a rectangular universe 
is of the form F(X). 

The limit theorem implicit in this illustration can be established in sufficient 
generality, but I do not yet know whether it has other applications of value. 
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MOMENT RECURRENCE RELATIONS FOR BINOMIAL, POISSON 
AND HYPERGEOMETRIC FREQUENCY DISTRIBUTIONS 1 

By John Rioudan 

I. Introduction. This paper gives the development of recurrence relations 
for moments about the origin and mean of binomial, Poisson, and hyper¬ 
geometric frequency distributions from the basis of the moment arrays defined 
by H. E. Soper . 2 This procedure has the advantage of expressing the moments 
in terms of coefficients which are alike for the three distributions and are de¬ 
rivable by a single process, thus providing a degree of formal coordination of 
the distributions. For both kinds of moments, the coefficients satisfy relatively 
simple recurrence relations, the use of which leads to recurrence relations for 
the moments, thus unifying the derivation of these relations for the three 
distributions, ■ The relations derived in this way for the hypergcometric dis¬ 
tribution are apparently new. Apparently new recurrence relations for certain 
auxiliary coefficients in the expression of the moments about the moan of 
binomial and Poisson distributions are also given. 

This course of development involves repetition of a number of well-known 
results which is justified, it is hoped, by the unification obtained . 3 


1 Presented to the American Mathematical Society, Sept. 3, 1936. 

* Frequency Arrays, Cambridge, 1922, 
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320-321. 
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2. Moment Arrays. As developed by Soper, frequency distributions may be 
exhibited by frequency arrays, in the case of a single variate, in the form: 

(2.1) /(A) = £ p* A* 

z 


where p x are the frequencies with which the measures, x, of the character, A, 
occur in a population. 

The substitution A - e a leads to the moment about the origin array: 


( 2 . 2 ) 


/(«“) = E?.®” 

=e4+“+J+-) ■ 


A 



where 



The symbol a is a logical or umbral symbol serving merely to identify the 
moments in the expansion of the array. 

The moment array for moments about the mean is found from the relation: 

t(e°) = e~ m ’/(e“) 

= IJ Mi a‘/n I 

A 

where mi is the first moment about the origin. 

The moment arrays for the distributions concerned are as follows: 

Binomial /(«") „ [1 + p(e“ - ])]” = £ ( n ) p*(e" - l) 1 
Poisson /(«*) = ~ 

i™[l X I 


Hypergeometric f(e a ) = 2 ^—Ji , 

2-0 (ft)* x 1 

\ 

where the parameters p, n, and a for the binomial and Poisson have the usual 
significance. The parameters for the hypergeometric distribution, with the 
substitution r ~ s, follow Soper; Pearson (loo, cit.) uses q, r, n, where q — l/n. 
The notation (i) r means 


(l) t — i(Z — 1) • * • (i — x -|- 1), 


It will be seen that, with the usual interpretation of 



as zero for x > n, 
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the three distributions so far as concerns a may be exhibited by a function 
of the form 

/(O = - D' 

whore ili of course depends on the distribution concerned. 


3, Moments About the Origin. The moments about the origin can then bo 
defined by the equation: 

(3.1) Em,“' = EAj.e° - 1)* 

8"=0 v? 1 £ 

and 

E A\f - 1)*= E^,£(- l)* - ' (*) e** 

1=0 1=0 v-0 \ V / 

*=o fi 1 ! *«■() 

where S x , „ is a Stirling number of the second kind, as used by Jordan (loc. cit.) 
and defined by 

*!&..- E(- ir» (*)„• = A*0\ 

11=0 W 

A*0* being in the language of the finite difference calculus, a “difference of 
nothing" that is A x n | n = O'. 

The internal series terminates at s because = 0, x > s, as is readily 
apparent in the finite difference expression. Further So lt ~ 0, 5 5^ 0; jSq.o “ 1. 

By equating coefficients in equation (3.1), m a , the sth moment about the 
origin, is given by 

a 

(3.2) m t = 

z=0 


The particular forms for the three distributions are as follows: 


(3.3) 

17la — V a 

Binomial , 

(3.4) 

s 

m 9 = 2 a Sx, * 

Poisson 




(3.6) 

V (0x(7')s= (T 

Wia — / j , . Uxt a 

x-u \n)x 

Hypergeometric 


The Stirling numbers have the following recurrence relation (Jordan loc. 
cit.): 

(3.6) 


s+L — a “4“ l t a ■ 
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This relation in conjunction with equations (3.3)-(3;5) leads to moment recur¬ 
rence relations. The procedure is illustrated for the binomial distribution as 
follows: 

«+i 

Wafl = £ (ft)® P $*, *+l 

x—0 

fl+l 

= £ (ft)® P T {% 9 -f* $*-!• «) 

X““0 

= p Dp m, -j- (npm t — p 2 D P m 9 ) 

- npm, + P? D P m, 

where q =■ 1 - p. 

The steps in the process arc expanded as follows; 

a+1 s 

£ (ft)®p T xS?,9 = £ (ft)a P* x & x , » 
x ™0 i =0 

* £ (ft)® S*, * pD P (p x ,) 

2=0 

=: pD„m, 

i-H 0+1 

2 (fOzP 5r-l h j “ ^ (T!f X + 1) 0t)j—Ip Sz-1, « 

i *0 i =0 

- ft £ (ft)® p a+l - £ z(ft)® p i+I S*, * 

- npm, - p^D^m, 

The results for the three distributions are as follows: 


(3.7) 

771 , 4-1 * npm, + pqDpin, 

Binomial 

(3.8) 

fti-H-i = am, + oD«m. 

Poisson 

(3.9) 

IrV 

m,+i - - m,(l - 1, r — 1, ft - 1) - (ft + l)A„wi, 

ft 

Hyper geometric 


Here D p and D* denote differentiation with respect to p and a, respectively, 
and A„ denotes the difference operation with respect to ft. For the hyper¬ 
geometric distribution the moments are functions of l, r, and ft as well as of s', 
m,(l - 1 , t - 1 , n - 1 ) is the same function of l — 1 , r - 1 and ft — 1 as 
m,(l, r, n) is of l, r, n. Equation (3.9) appears to be new, 
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For convenience of reference, a short table of the Stirling numbers of the 
second kind follows: 


0 

1 

2 

3 

4 

5 


0 

1 

0 

0 

0 

0 

0 


1 

1 

1 

1 

1 


1 

3 

7 

15 


1 

6 

25 


1 

10 


1 


4. Moments About the Mean. As shown in Section 2 above, moments 
about the mean may be defined as follows: 


(4.1) 


i^O 5 I 


z=0 


where mi is the first moment about the origin: 

mi = np Binomial 
= a Poisson 
= lr/n Bypergeomelric 

Now 


EU.e-" 1 ” (e"-ir=E^.E(-l) 

1=0 u-^D 


” (:) 


(u-mi)a 


= X ^ X x 1 A g j fl , 

j«=0 S J z“0 


where 


$ 1 o x , i = X(-' 1) 

«=o 


Tl 


(:) <»- m >y = 


A* (-* mi)*. 


It will be observed that for mi ~ 0, - The internal series terminates' 

at s for the same reason as before. 

The moments about the mean are then given by: 


(4.2) 




= X X 1 Ax £T*, < 


z .“0 


The particular forms for the three distributions are as follows: 


(4.3) 

P-, = X (ft)* P* * 

Binomial 

(4.4) 

a 

M* - X a * tf *. * 

I“[) 

Poisson 

(4.5) 

V^(0a(r)x 

(It). 

Hypergeometric. 
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The coefficients tr 4 , 4 satisfy the following recurrence relation:* 

(4.6) (J'x.b+I ™ (# a 4" flx—l,s 

which in conjunction with equations (4,3)-(4.5) leads to moment recurrence 
relations as before. The actual derivation is somewhat complicated by the 
circumstance that a Xlt is a function of mi and therefore of the frequency param¬ 
eters, rather than a constant as before. The derivation is illustrated for the 
binomial distribution as follows: 


<+i 


Jt^+i 


— X/ (v^x V Gx, M-l 


zr*=0 

H-i 


= 2 (n) x v % K# “ np)**, ■ + ^-i, t) 


1^0 

A 


S+1 


= 2 (n)x v*, a yDpil?) - ni¥* + 2 ( ra )* V* <r*-i, * 




x^Q 


= pD v fi„ 4- nspn«.,i - npv, -f 7ipn, - p\D p ju a + nspi^i] 
- VQ 


The steps in the process are expanded as follows: 

& a 

2 {n) x ff x , a pD P (p x ) = 2 (n) x [pD P (p x <Tt,s) ~ p *,,)] 

a 

= pD p p s - p 2 (n)*y*(~ nsa*, a -i) 

*■=0 

^ + TtSPHs —1 

j-H *+l 

2 = 2 (?1 - ^ + 1) (f0*-i2>*tr*-i.* 

- ft 2 W« P I+1 tfx ( » - 2 p I+1 ffM 

x^Q x=D 

= ftp/i, - j/[I>pp s + ns M a -i]. 

The relation D v c Xig = — ns^ s , s _i is obtained from the definition equation of 
<r tl * (with mi - np ). 


The resulting recurrence relations for the three distributions are as follows: 

(4.7) jj 8 +i — rasp j p 8 _i pq D p )i, Binomial 

(4.8) p i+ \ = asjji s _i -J - a Dap, Poisson 


4 Jordan, loo. cit. or E. C. Molina, An Expansion for Laplacian Integrals . .., Belt 
Syatem Technical Journal, 11, p. 671, 
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(4.9) 


where 


/J»+i = (» + 1) 



[ Mb 5 0 K * ^ r ’ 71 + *)] 

2 Q — 1, r — 1, n — 1)J 


Hypergeometric 


Ki 

K, 


— lr _ lr 
n{n ■+ 1) “ n n 

(l - 1) 6r - 1) 

(» - 1) 


lr 

n 


The last of these, which appears to be new, seems to be of formal interest only. 
The coefficients tr*,. are related to the Stirling numbers by the expression: 




( 1) 


v~D 



J— z 


and consequently can be exhibited with detached coefficients in the form 
Go + + «2 + ■ * * + Gi-s ■ For the binomial and Poisson distributions 

certain simplifications, to be developed in the section following, in equations 
(4.3) and (4.4) may be made. For the hypergeometric distribution it appears 
necessary to use equation (4.5); the following short table of tr x , t) employing the 


detached coefficients mentioned above, is given for this purpose: 

\ a . T** i 

i\ 

0 

1 

2 

8 t ft 

1 

0-1 

1 



2 

0+0+1 

1-2 

1 


3 

0+0+0-1 

1-3+3 

3-3 

1 

4 

0+0+0+0+1 

1—4+0—4 

7-12+6 

0-4 1 

5 

o+o+o+o+o-i 

1-5+10-10+5 

15-36+30-10 

25-30+10 10-5 1 


5. Binomial and Poisson Moments About the Mean—Simplified Formulas. 
5.1 Binomial. From examination of the first few moments about the mean, 
it appears expedient 6 to write the formulas: 

B 

Mu = T,<Xx.*Anpq¥ 

(5.1.D ‘- 1 B 

= (q-v)T, a *' fc+i (npq) x 

i— 1 


i The kind of expression chosen admits of some variety. A recurrence relation for 

B 

coefficients in the expansion /i, — ^ <*x.iV* has been given by E. H. Larguier, On a Method 

For Evaluating the Moments of a Bernoulli Distribution, Dull, Am, Math, Soo., 42, 1, p. 24 

(Abstract 8): I am indebted to Mr. Larguier for the opportunity of examining his results 
1 1 

in advance of publication. 
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When these are substituted into the moment recurrence relation, the coefficients 
are found to be related as follows: 

OL Xl 2« = [X + P^pq\ a x,U-l + (2S — l)tfx~l,2g-2 
i 

-2pg[l 4* 2s d- 2pgD P g]otx f 2 S ~j 


a*, 2 »+i = [as + pqD ptI )a Sl i, 4- 2sa,-ua-a 


or, in general, 


(5.1.2) 


Ofx.a-5-l " [X -f pgDpglota,, + SCI£x_l (t _l 

“ pff[l “ (-1)1 [1 + + 2pqD P g]a XlS 


Using detached coefficients of powers of pq as outlined above, these coeffi¬ 
cients may be exhibited as follows: 



2 

3 

4 

5 
0 

7 

8 
9 


1 


2 


3 


4 


1 

1 

1-6 . 3 

1-12 10 
1 - 30 -f 120 25 - 130 

1 - 60 4 360 56 - 462 

1 - 120 + 1680 - 5040 119 - 2156 4 7308 

1 - 252 + 5040 - 20160 246 - 6948 4 32112 


15 

105 

490 - 2380 105 

1918 - 13216 1260 


It may be noted that the coefficients of the first column in conjunction with 
equations (5.1.1) give the binomial seminvariants. 

Equations (6.1.1) make the coefficients functions of pq only; a slight alter- 
ation makes the coefficients functions of n only, Thus: 


(5,1.3) 


w* — 2 {vq) 


t-i 


/i 2 .+i -(q- p) D&. 2.+1 w 


and the coefficients are found to satisfy the recurrence relation: 

(5.1.4) ~ x/? X| t 4 n&ftx— i,i-i — [1 ( 1) ](2x — l)^r^i,*. 

These coefficients may be exhibited by a rearrangement of the table given 
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above as may be seen by comparing equations (5.1.1) and (5.1.3). The first 
few coefficients arc as follows: 

V 

n' 1 ?«,. 

A 

1 2 3 

2 

1 

i 

3 

1 

4 

1 -6 + 3 

5 

1 -12 + 10 

fi 

1 - 30 + 25 120 - 130 + 15 

1 

5.2 Poisson. 

The Poisson moments about the mean may be expressed as 

follows: 

i 

|t/2] 

(5,2.1) 

« 

IJ 


where (] represents “integral part of and 


(6.2.2) = $Ux,a d* 

The coefficients a Xit are the constant terms in the expressions for the corre¬ 
sponding binomial distribution coefficients in powers of pf, 


Bell Telephone Laboiutoiubs. 



NOTE ON ZOCH’S PAPER ON THE POSTULATE OF THE 

ARITHMETIC MEAN 

By Albert Wertheimer 

1. Introduction. There appeared recently a paper by Richmond T. Zoch 1 
entitled "On The Postulate of the Arithmetic Mean.” The stated purpose of 
his paper, was to show that the derivation of the Postulate as given by Whit¬ 
taker & Robinson, is not correct. It is the purpose of this paper to show, 
that Zoch has not proven any error to exist in the Whittaker & Robinson deri¬ 
vation, but that there are a few errors in his paper. As this paper 13 intended 
to be read with Zoch'a paper as a reference, the terms used there will not be 
redefined'here, and except where otherwise stated, the symbols used will have 
the same meaning. 

2. Zoch introduces the function 

and claims that it satisfies all the four axioms of Whittaker & Robinson, and 
obviously it is not the arithmetic mean. He therefore concludes that their 
derivation must have errors somewhere, and proceeds to find them. Let us 
first examine the / function. Considering only the part /ia/n 2 , the partial 
derivatives with respect to &,■ are given by 

3ju2{(£i — #) 2 — ffcj ~ $) 

2 

n,us 

It is then stated (p. 172) tr ... clearly these partial derivatives are single valued 
and continuous. Therefore the function fi 3 /(i 2 satisfies axiom IV.” Now, 
the condition that a function be continuous and single valued means of course 
that this be true throughout the region of definition of the function. It is not 
shown how these derivatives are clearly continuous and single valued for the 
very important case where all the e’s are equal and the derivatives become 
indeterminate. As a matter of fact they are not continuous in this case, and 
therefore the / function does not satisfy axiom IV. To prove this, we only 
have to consider the very simple case where we let 

Si = Jc + CiZ 


‘This Journal Vol. VI no. 4, Doc. 1935, pp. 17M82. 
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where k is a fixed constant, C( is a set of-arbitrary constants not all equal, and 
z is a parameter. We then have 

x = k + cz 

f 2 

, 

l 8 

Ma = M3Z 

where 

c - 1/n 2 Ci 
4 = 1/n £ (cv - 5) 2 
. 4 = 1/n 2 ( c f — c) a 

Substituting these values in / and the derivatives, we get taking a — 1, 

/ = k + zb -f* £4/*4 

df/dx . = \j n + 3^w[»*(e ( - 8) 8 - 2z‘to(c t - a) 

n?Vr 

Now going to the limit when t approaches zero, and all the &'s approach k, 
we get 

limit / ^ k, 

r ^»0 

limit OJ/dXi = l/n| -2 + 3(c, - a) 2 *; - 2 ^(«, - «)/*') 

z-+0 


Thus, when all the z’s approach the same value, the function / also approaches 
the same value independent of the c’s, that is regardless of the mode of approach, 
while the derivatives can take on any value depending on the c's that is on 
how the limiting value of / is approached. The / function then does not have 
continuous single valued partial derivatives, and therefore does not satisfy 
axiom IV. 

In part 2 of the paper it is stated "Now when the Zf all approach a then both 
/ and df/dXj become indeterminate forms. However, in this ease / takes an 
indeterminate form which can be evaluated and it can be shown that fi^/m 
will always have the value zero, i.e,,/ will have the value a when all the z f —> a ; 
while the df/dXi can take any value whatever and in general the df/dx* will 
not be equal when the Xi -+ a” This statement really amounts to saying that 
the / function does not satisfy axiom IV, but it is there used to demonstrate 
that one of Schiaparelli's propositions is false. 

\ 

3. Having exhibited a function different from the arithmetic mean, and sup¬ 
posedly satisfying all the four axioms, the question is asked "Where is the proof 
given by Whittaker & Robinson lacking in rigor?" After numbering the 
various steps in the derivation "... for the sake of rigor and careful reasoning 
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..it is stated (p. 174), “The sixth* step involves the tacit assumption that 
the partial derivatives are functions of k. These partial derivatives are not 
necessarily functions of k ..and it is therefore concluded that the sixth 
step is not valid. Now, how can any function that by definition is to be evalu¬ 
ated at 6kxi not be a function of 7c? What is shown (pp. 174-5) is that 
these derivatives do not necessarily involve k explicitly, but this is neither 
implied nor necessary for the sixth step, and there is no ground for doubting 
its validity. 

4. In order to overcome the supposed defect in the sixth step, it is proposed 

to change axiom IV so as to require the partial derivatives to he constants. 
But even then (p. 175) . . there remains an objection in the seventh step," 

Now, the seventh step consists of the statement that if 

4>(xi) = £ W 

where the c’s are independent of the re's then due to the condition that $ be a 
symmetric function, all the c's must be equal. To show the defect in this 
step it is stated, that under certain conditions ,f ... the function / ^ x + ju 3 //i 2 
will have partial derivatives with respect to a;,' which are unequal and constant; 
yet at the same time the function / is a symmetrical expression of the n vari¬ 
ables.” Granting that all that is correct, what has this got to do with the 
seventh step? The / function certainly is not of the type 2 c { X{ to which 
the seventh step is applied. 

5. One more point should be mentioned. On p. 181 it is supposedly proven 
that any function satisfying the first three axioms must have continuous first 
partial derivatives. The proof is essentially as follows: Assuming all the s’s 
arc given the same increment Ax, the increment of the function then is A0, 
It is then stated "... but by axiom I, A<£ = Ax. Therefore A<f>/Ax = 1 = d^/dx. 
In other words, the total derivative of exists and is constant, Therefore the 
total derivative of <f> is continuous." From this, the continuity of the first 
partial derivatives is proven by means of Euler’s Theorem for homogeneous 
functions. Now, just what does the symbol dtftjdx (which is called the total 
derivative) mean for a function of many independent variables? Besides, 
(whatever this symbol means) is.it considered rigorous to deduce a general 
Theorem from the very special case where all the differentials are made equal? 
This is one place where the / function could be used effectively as an exhibit 
of a function satisfying the first three axioms, and not having continuous partial 
derivatives. 

It is also stated (p. 181) that "... it would seem more satisfactory to postu¬ 
late that the function $ is single valued, for the single-valuedness of a derivative 
docs not insure the single-valuedness of the integral while the single-valuedness 
of a function does insure the single-valuedness of the derivative where the 
derivative exists.” This statement is certainly not self evident and requires 
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proof. For a single variable at least, it is easy to imagine a function repre¬ 
sented by a curve with comers defined in a certain interval. The function then 
could be single valued everywhere in the interval, while the derivatives at the 
corners may exist and have two distinct values, depending on whether the 
corner is approached from the right or the left. On the other hand it is hard 
to imagine a curve representing a single valued function such that the integral 
i.e. the function represented by the area under the curve should not be single 
valued. 

P 

6. In Conclusion: It is stated in the Introduction that “Since this book has 
had wide circulation, it is believed that the errors in this proof should be called 
to the attention of the users of the book. The present paper has been prepared 
for this purpose." It is for the same reason, that this paper was prepared to 
show that no error has been proven to exist, 


Btjueau op Ordnance, U. S. Navy Department 



NOTE ON THE BINOMIAL DISTRIBUTION 

By C. E. Clark 


The purpose of this note is to show that 



,,_a f Q n n\/p\ x sin ‘irac 

/(l) = (_ i ) — y ^ 


where n is an integer £ 0,0 < p < 1, p + q - 1, and aJ"' 5 ' 15 = x(x — 1) (x -2) 
- • • (x - n), is a function whose values at x - 0,1, 2, ■ • ■ n are the successive 
terras of the expansion of (g + p) n , and also to consider the problem of fitting 
j(x) to an observed frequency distribution. 

The statement made about (1) can be verified by evaluating (1) as an inde¬ 
terminate form. On the other handj (1) can be derived by observing that the 
x-th term (i an integer) of the expansion of (g + v)* is 


( 2 ) 


n I * n-x _ T(n + 1 )pY * 

x\(n-x)\ Vq r(* + I)r(n-*+!)’ 


then (1) can be derived from (2) by means of the product expansions for T(x) 
and sin x, This derivation of (1) from (2) can also be carried out by expressing 
(2) as a Beta function and then using 


B(x + 1, n - x + 1) 


r r 

Jo (1 + 0"+ 2 


dt *s 


(-i r 


C»+l) 

7T X _ 

(71 +1)! sin irx* 


This integration can be performed by means of the theory of residues. 

Consider the problem of fitting (1) to an observed frequency distribution. 
We shall write (1) in the form 


(3) .* = + *(* - *) 

and determine the constants a, b, n, and h so that, when 2 is the mean of the 

observed distribution, F(z) will fit the distribution, 

The values of a, i>, n , and h can be determined by the method of moments. 

Let Vi , , and n , denote the usual second, third, and fourth moments of the 

distribution, which are calculated in the usual way (as in W, P. Elderton, 

Frequency-Curves and Correlation) and not adjusted by.any procedure such as 

2 

Sheppard's adjustments. Also, use the usual notation ]9i = ^ and ft = 

t»2 Vi 
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Then, the method of moments gives 


(4) 

( 6 ) 


= 2 
3 + 0i — 0s 

2 -f n(3i ± Vtt0i(4 + nfi\) 
2 


‘ - f?(nb) 


a = (-!)’ 


A(S/)nl 


•, where 2/ is the sum of the frequencies of the distribution. 


7r(l 4* &)" J 

An integer n is chosen, nearest the value assigned by (4). The two values of 
b from (6) determine two curves that are congruent but whose skewnesses are 
of opposite sign. Hence, b is uniquely determined by (5) and the sign of the 
skewness of the data. 

For a symmetrical distribution, 6 = 1, vj — 0, and 

2 


n — 


h = 


3-fr 
Vn 
2 a/ P2 


We shall consider an illustrative example. In the following table the columns 
f(z) and fi{z) are taken from W, P. Elderton, Frequency-Curves and Correlation 
(1906), page 62. f(z) is an empirical frequency distribution, while fs(z) is 
obtained by fitting a Pearson Type II curve to the distribution f(z). fi(g) is 
computed from 


/,(*) = 1624 2^, as = 2.0973 + .80& 


which is determined by the method of this note. f 3 (z) is obtained by fitting 
the normal curve 

(f-,1085) 1 
2 ( 1 . 820 ) 


Uiz) = 486.1c 


% 

f(z) 

M*) 

/l(z) 

m 

—r3 

11 

18 

14 

19 

-2 

116 

107 

109 

92 

-1 

274 

281 

286 

263 

0 

461 

438 

433 

444 

1 

432 

437 

433 

444 

2 

267 

267 

285 

263 

3 

116 

106 

109 

92 

4 

16 

18 

14 

19 


The coefficients of goodness of fit for f 1 (z) ) / 2 (z), and f 3 (z) 'are respectively 
.35, .58, and .02. 



CONVEXITY PROPERTIES OF GENERALIZED MEAN VALUE 

FUNCTIONS 1 

By Nilan Norris 


Consider the following generalized mean value functions: (1) the unit weight 


or simple sample form, 0(1) = 


__ ( ^ "1~ ^2 "l" • • * H~ ftfAi • 


n 


, in which the a,- are posi¬ 


tive real numbers not all equal each to each, and in which i may take any real 
value; (2) the weighted sample form, u>(l) ~ C2Xi 

\ C\ + Ca + “ ■ T c " / 

in which the c< are positive numbers not all equal each to each, and in which the 


%i and l are restricted as in 0(i); (3) the integral form, 8(t) — 


xdx 


where I a:Via: exists for every real value of i] and (4) the generalized integral 


I 


form <h(f) = J xVtyft)^', where ^(x) is a non-decreasing function integrable 
in the Riemann-Stieltjes sense such that 0(«>) - 0(0) = 1, and such that 
x l d^(x) exists for every real value of t. The facts that all of these func- 


/■ 


tions are monotonic increasing and that both 0(1) and wft) have two horizontal 
asymptotes have been previously demonstrated, 2 Although the existence of 
0(f) and «(l) lias been known since 1840, there appears to have been no attempt 
made to investigate the behavior of the second derivatives of them. 3 

When the x,- are price relatives, production relatives, or similar data, 0(1) 
andw(J) yield common types of index numbers by direct substitution of integral 
values of t. For any values of t such that 0 < <j < U < « } the type bias of 
0 ft) will be greater than the type bias of 0ft). Similarly, for any values of t 
such that — « < k < k < 0, the type bias of 0ft) will be greater than the 
type bias of 0ft), The second derivatives of 0(1) and w(J) indicate whether 


1 Presented at a joint meeting of the American Mathematical Society, the Econometric 
Society, and the Institute of Mathematical Statistics at St, Louis on January 2, 193G. 
The writer is indebted to C, 0. Craig, Einnr Hille, Dunham Jackson, and J. Shohat for 
helpful critical reviews of the preliminary draft of this paper. 

2 G. H. Hardy, J. E. Littlewood, and G. Pdlya, Inequalities (Cambridge University 
Press, London, 1034), pp. 12-16; tuid Nilan Norris/‘Inequalities among Averages,' 1 Annals 
of Mathematical Statistics, Vol. VI, No. 1, March, 1936, pp. 27-29. 

* Jules Bihnaym^, Socttlb Philamatiqucde Pans, Extraits des proc^s-verbaux des stances 
pendant I'annde 1840 (Imprimerie D’A. Ren6 et Cie., Paris, 1841), SGance du 13 juin 1840 

p. 68. 
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type bias is changing at an increasing or a decreasing rate as between the un¬ 
limited number of averages available for use. Considerable interest attaches 
to u(0> the weighted sample form of function. 

Let u(f) be made arbitrary for the case of n - 2, with s, = 1, and x 2 ■= e“\ 
where X is any real number. Also let c x - a, and - 0, where a -J- 0 = 1. 

THen o>(0 = [a (3<T Xi J‘. Now for all values of (, . 


For 1 1 1 sufficiently small, it follows that 
log (« + (&*) = - m + i /3X 2 (1 - /3)i a + | + | - 


so that for / ^ 0 



j log(« + fr*) = -jSX + *j9X a (l - + + | • 

Therefore o>(i) = exp, ^ log (a + |3e“ x ') 

= + *0X 2 (1 - 0)t + /3X a + + -jd) , x|t 2 + *■]. 

1 

S 

that a)(0) is the weighted geometric mean, and that 4>(0) is the unit weight or 
simple sample form of geometric mean. As a means of demonstrating the range 
of values which u"(0) may take it is helpful to rewrite the expression for Q f, (0) 
as follows: 


- P) 2 x . 


It is clear 


It follows that «"(0) = 2j3XV 


1 

0 


*+£-£+ 
0 T 2 3 


«"( 0 ) = 


= ij9 s u-(5)V x-} ^ 


This consideration makes it possible to distinguish three cases of y = /(X, 0) 
for fixed 0, namely, 0</3<^;/3 = f; and £ < 0 < 1. In all three cases 
/(X, 0) has an absolute minimum ^(j9) ^ 0, and /*(£) = 0. The corresponding 

values of X satisfies the quadratic equation X 2 - ^ 4^—-i X + 0. 

\ 3 pU - P) P U - P) 

It is clear that by taking 0 near enough to 0, one can make n(0) as large negative 
'»as is desired. Also, by choosing X properly, one can make gj"( 0) take any 
value between fi(0) and <*>, For example, when a = 0 — X may be selected 
so as to make w"(0) any arbitrarily chosen non-negative number. For then 
X 4 

a>"(0) = —-e \ and as X increases from — « to 0, w"(0) decreases from <« to 
64: 

0. If X = 0, w"(0) = 0. If X > 0, as X increases from 0 to 8, o/'(0) increases to 
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64e~ 4 , and as X increases beyond 8, w"(0) decreases, approaching 0 as X increases 
indefinitely, It is evident that the case of a « jS = with X = “log 2, ®i = 1, 
and = e~ x , is one in which w(t) becomes the unit weight or simple sample 


type of generalized mean value function, namely, <^(i) 



Reference 


to the first expression above n< 
fogft -y/2 in this special case, 


for <c/'(G) will make clear that 0"(D) = 


Analysis of $(t), the generalized integral form of generalized mean value 
function, makes it possible to characterize populations of a very general char¬ 
acter, as well as samples, But in the case of <£(<) it is even more difficult to 
generalize as to convexity properties. For example, let 


m 



“i 


where 




<f 1,1 dv, 


This expression is obviously of the required generalized integral type. Now 



-Ajf .-M)**.,?. 

\MT .J ■—* 


* e 4 

Therefore 3>(t) = e 4 , and $"(£) = — > 0 for all L That is, in this particular 

case, $(t) has only one horizontal asymptote. 

The foregoing examples indicate that the following conclusions may be drawn 
as to the diverse convexity attributes of the various means as functions of i; 
(1) The unit weight form, 0(£), and the weighted sample form, w(£), must always 
have a point of inflection, since both of them not only increase with t } but are 
doubly asymptotic (have two horizontal asymptotes). (2) Points of inflection 
for 0(t) and w(f) do not necessarily occur at t = 0. (3) The generalized integral 
form, $(0, need not always have a point of inflection. That is, the second 
derivatives of certain forms of $(t) do not change their sign, since such forms 
are concave upward, 


University of Michigan. 



A SIMPLE FORM OF PERIODOGRAM 


By Linsmore Alter 


Schuster's introduction of a method of systematic search for hidden periodici¬ 
ties and cycles opened a new field for the investigator of statistical data. The 
beauty of his method in its analogy to analysis of light, and the great reputa¬ 
tion of its author, combined to give it universal acceptance and to blind statis¬ 
ticians to its faults. , 

In more recent years at least three new mathematical and two mechanical 
forms of periodogram analysis have been proposed, each of which exhibits 
certain advantages over the original one. The use of the term periodogram 
for these forms is an extension of Schuster's original definition which used as 
abscissae quantities proportional to the squares of the amplitudes of the sine 
terms found in the data for the various trial periods, He wrote: “It is con¬ 
venient to have a word for some representation of a variable quantity which 
shall correspond to the spectrum of a luminous radiation. I propose the word 
periodogram and define it more particularly in the following way: 


‘ii+r rti+T 

Let hTa « I f(t) cos kidt and iTb - / f(l) sin Udt 
hi Ji i 


where T may for convenience he chosen equal to some integer multiple of 


2 T 

T' 


2ir . . 

and plot a curve with as abscissae and r = V'a 2 + as ordinates; this curve, 

fc 


or better, the space between this curve and the axis of abscissae, represents the 
periodogram of /(<).” 

The following appear to be the essential criteria for a satisfactory form of 
periodogram: 

1. It must exhibit plainly any repetition of form in the data regardless of 
how irregular the shape of the repeated interval may be. In doing this it 
must exaggerate the amplitude of the main terms at the expense of the 
lesser ones. 

•2. The calculation of the indices must be short. In a periodogram from 
many data the indices sometimes are computed for several hundred trial 
periods. 

3. There should be a geometrical interpretation of the index used,, 

4. The frequency distribution of the index must be known. 

6. Combining or smoothing the data should modify the index in a manner 
which leaves an obvious interpretation. 
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The Schuster periodogram has the following disadvantages: 

1. Only sine terms of large amplitude arc exhibited. A perfect repetition 
of an extremely irregular form of data would not be indicated in any way. 

2. The calculations are long. 

3. There is a considerable uncertainty in the length of the period found. 
Those methods of analysis which use harmonics as well as the fundamental 
have much less of this uncertainty. 

The correlation periodogram has advantages in each of these points over the 
Schuster. However, even with it the calculations are fairly long. Further¬ 
more, the modification of the coefficient introduced by grouping or smoothing 
is not a linear one. 

The periodogram described here is a slight modification of one for which a 
preliminary note was published in 1933.. Additional features have been studied 
and its applications to many data have shown its ease of calculation. This 
calculation has been reduced still more by a mechanical method which renders 
it practicable to contemplate the possibility of studying many data hitherto 
prohibited by excessive cost. 

Consider data a 0 , %\, xt , • • * Xi, ■ • • Let l be any integer less than n. 

Form the sum of the absolute values of Xi — designated by 2 | £i — 


Define A = E —- 

i-i n - l 


l takes the values of the various trial periods and 


is called the lag , A, therefore, is the mean error between prediction that data 
will be repeated after a lag of l and the fulfillment of the prediction. Such 
an index has a meaning that is immediately of use to a meteorologist or other 
investigator, Coefficients such as the Schuster and the correlation coefficient, 
although valuable statistically, are of less immediate interest. 

The standard deviation of these errors of prediction follows at once from 
standard formulae under assumption of normal distribution. 


<7 = 1.25 A 


The distribution of <r, as computed from the absolute values of data, has 
been studied by Helmert and by Fisher. Davies and E. S. Pearson have com¬ 
pared the various methods of estimating tr. For the large number, (n — l), 
pairs of data used for a periodogram point, this method becomes almost as 
precise as the usual one which would square-the values of (%i — Xi-i). For 
(ft — 2) as small as 50, the standard deviation of the standard deviation by this 
method is only seven percent larger than by the other one. Fisher has shown 
that 

Vn —1 

This may be written as 

l.Q68(r 

\/2(ft - l) 


/ 


7T '— 2 


as (n — l) cc 
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The distribution approaches normal rapidly and for all values of (n - l) that 
would bo used in periodogram calculation certainly may be considered as normal. 
It will be very seldom that a value of (n - J) much smaller than 200 will be 
used. 

i 

The data may be printed on two strips of adding machine tape held together 
by clips so as to match data separated by a lag l. In arranging them for investi¬ 
gation, it usually is most convenient to make all numbers positive. The 
computer subtracts mentally and puts the difference into an adding machine, 
which gives him A almost immediately. 

For some computers, and especially where the numbers are large, another 
method of obtaining A may save time or lead to less numerical mistakes. The 
computer will form the sum of all his data. He will, as for the other form of 
computation, put these on two pieces of adding machine tape that he lays side 
by side. However, instead of putting the difference of the pairs into the ma¬ 
chine, he will, in each case, put in the smaller datum of the pair. Then, 

(n - l)A i = 2 2 all data - £ 1st (n — J) + £ lost (w - l ) data] 

— 2 ■,£ smaller 

The derivation of this equation is obvious. In computing by this method the 
subtotaler on the machine can be used to make the strip of sums of the first 
(w - l) data and of the last (n - l) for all values of l The first term on the 
right hand side is a constant, the last is twice the sum of the smaller numbers 
chosen in the pairs. I have computed by both methods, and where the numbers 
are small, I prefer the former. Where they are large, I prefer the latter. How¬ 
ever, when one must use comparatively untrained computers, he will find less 
mistakes made if the computer does not make the subtractions. 

The calculation of A is much shorter than that for the indices even of the 
correlation and variance periodogmms, It may, however, be shortened even 
more by a mechanical arrangement. (?i - f)Aj is the area between two histo¬ 
grams of the data matched after a lag l. These may be carefully graphed on a 
large scale and two such graphs superposed over a table with a translucent 
illuminated top. On the edge of this tabic is the track to guide a rolling pla- 
nimeter. A, as computed by this means, is accurate to approximately one-half 
of one percent of its value, a much more exact value than is needed. The 
details of such a device as constructed for the Griffith Observatory are shown by 
the accompanying photograph and diagram. The dual saving of time by the 
method and by its mechanical application have resulted in the adoption of a 
much more ambitious program of meteorological research than previously was 
contemplated. 
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Scale Diagram of Planimeteh Device 



PLANIMETETt DEVICE FOR MECHANICAL CALCULATION 
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The form taken by the periodogram is important. Consider the simplest 
case, data which follow a sine curve. 



The term in brackets takes values distributed around the circle and the part 
outside is a constant for any one lag. The bracket term sums approximately to 


—- - , since we consider all terms as of one sign only. 

TT 



If the absolute values wore not considered in the expression for A j , the periodo- 
gram would be a sine curve of period 2p. The lack of sign gives a cusp curve 
with the cusp at lags p, 2 p, etc. Such a form is advantageous in that the 
perioclogram gives sharp peaks at multiples of the periods which may exist. 

The effect of the periodogram in exaggerating the principal terms at the 
expense of the smaller ones may be obtained most easily by equating a as- 
obtained by the linear and the quadratic formulae. 
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The data may be written as the sum of cosine terms 

( 2ir! cpa\ , i f2m — 

-^) + 6cos (-— ) 


+ 4 1 ' + c, 


y< - y*-i 


2 a sin — 
V* 



2ir (2 ^ 1 

Pa 


+ ' ' ‘ + (c* —C(-i) 


2 (Vi ~ l/t- 1) 2 — 2 (n — On 2 sin 2 — + 2 (n — Ob 2 sin 2 — 4 - • • • 4 - (71 — Z) -\/2 ol 

Pa Vb 

The sine terms contribute to A 2 in proportion to tlie squares of their ampli¬ 
tudes. On account of the sin 2 — factor, they contribute very little to values 

Pi 

Ttl 

of A 1 for which — is not very closely an even multiple of r. 

Vi 


This method has been applied to rainfall data of the Pacific Coast and has 
proved as satisfactory in practice as would be expected from the simplicity 
of the theory. The periodogram of rainfall stations along the northern third 
of the California coast is shown here, exhibiting perhaps the most definite 
single piece of evidence over found for rainfall cycles. Outstanding is a cycle 
of about 45 years with its fourth harmonic as the secondary feature, The 
writer expects to publish the results of that, work in the Monthly Weather 
Review, 
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ON CERTAIN DISTRIBUTIONS DERIVED FROM THE MULTINOMIAL 

DISTRIBUTION 1 

By Solomon Kxjllback 


1. Introduction. With the multinomial distribution as a background, there 
may be derived a number of distributions which' are of interest in certain prac¬ 
tical applications. Several of these distributions are here presented and the 
theory is illustrated by specific examples. 


2. Preliminary data. In the discussion of the distributions to be considered 
there arc needed certain factorial sums whose values arc now to be derived. 
In the following discussion only positive integral values (including zero) are 
to be considered. 

There is desired the value, in terms of N , n, r, of 



fr(n t N) 



jVI 

El! X 2 \ ■ • ■ E „1 


where the summation is for all values of xi, x 2 , ■ • • ,x n such that xi + 4* • ■ * 

-\-x n = N and no x is equal to r. . 

Let us first consider the case for ?■ = 0; i.e., we desire a value for the sum in 
(2.1) for all values of X \, , ■ * *, x„ such that Xi -f x 2 -f ■ * • + x n = N and 

no x is equal to zero. By the multinomial theorem, we have that 2 


(2.2) 


(dl -(- 0,2 ' * * “h ®n) = ^ 


N I 


Ei! Xi\ ■ ■ ■ x n \ 


aV a? 


a 


Zfl 

n 


where the summation is for all values of Ei, x 2 , * ■ * , e„ such that -f- x 2 + • * * 
-{• x n — N. If a\ — Os = * ■ ■ = a n = I, then 


(2.3) 


n 




m 


Ei ~h e 2 “h ■ ■ • -j- x n = AT. 


El! X 2 I ’ ■ 1 eJ , 

The sum in (2.3) may however be rearranged into the sum of a number of 
terms as follows: 

N\ 


n 


(2-4) 


Ell X 2 \ E*!’ 

p jyi 

"^Eilsa! ■ Xn-iV 


n(n — 1) 


N I 


Xi\x 2 \ ■ • * x n ^\ 


Ei + -f * * • + Xn - N t no x - 0; 

Ei 4- + • ■ ■ + E n _! = N, no e = 0; 

, Ei + Xi -f • ■ - + Ert -2 = N, no x =s Oj 


0 


y m 

T J Ei!e 2 1 ‘ * • En-rl 1 


Ej + *2 + ’' 1 + E n „ r = N, no X = 0. 


1 Presented to the Institute of Mathematical Statistics January 2, 1036. 

1 H. S. Hall & S. R. Knight, Higher Algebra, MacMillan & Co., 4th Ed. (1924), Chap. 16. 
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Thus we may rewrite (2.3) as 

n* = /o(n, N) -f nfo(n - 1, N) 


(2.5) 


n(n - 1) ./ 

H-2l— 


- 2, M) + - • • + — r,N)+ • • ♦ 


Replacing ?t by n — 1 in (2.5) there is obtained 

' (n - 1)'» A(n-1, JV) 

(2-6 + (» - DM* - 2,N) +•■■ + (" 7 ^/.(n - r - 1, AO + ••■ 

Multiplying (2.0) by and subtracting the result from (2.5), there is obtained 
n* - n{n — l)*" = Join, N) 

{%7) a, JV)- r (r + — r — 1,N) — ••• 

Replacing n by n — 2 in (2.5) there is obtained 
, {n - 2)" - / 0 (n - 2, JV) 

(2.8) / n _ 2\ 

+ (»-2)/ 0 (»-3,tf).+ ... +^_^/o(n-r-l J iV)+ 

Multiplying (2.8) by n{n — l)/2 and adding the result to (2.8), there is obtained 

»' -n(n-l)" + n ^l (n - 2)" = /„(», W) + 

(2.9) 

Wn - 3, W) + • • ■ + ( f “ J Mn - r - 1, N) + • • • 

Continuing this process, there is finally obtained the result that 

(2.10) /,(», N) = n* - n(n - 1)" + (fl - 2)"-± it-1" 


It may be shown 3 that the right side of (2.10) is A"^ for x = 0. The author 
has elsewhere obtained (2.10), but by a special procedure not applicable to the 
general case/ 

We may readily verify (2.10) for example, for n = 3, N = 5. If + ar 2 
+ a* = 5 and no x = 0, then the sets of solutions are (3,1,1), (1,3,1), (1,1,3), 

(2,2,1), (2,1,2),(1,2,2),and/,(3,6) = »• 16 °' From ( 2 - 10 ) 

there is obtained/ 0 (3,5) *= 3 s — 3,2 5 + 3.2/2 = 150. 


* E. T. Whittaker & G. Robinson, The Calculus of Observations , Blaokie & Son Ltd. 
(1924), p, 7. 

* 8, Kullbaok, “On the Bernoulli Distribution," Bull. Am, Math. Soc,, December, 1935. 
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For the general case, we return again to (2.3) and rearrange the right side 
into the sum of a number of terms as follows: 



nN {T) 


( 2 . 12 ) 


n N - fM N ) + '^—fr(n - 1 ,N-r) 


, ft(« — l)N (2r) 1 f 0 * T 0 ^ , 
+ — 21 (rl) a - 2 > N ~ 2r ) + 


where AT'*’ = N(N - 1 )(JV - 2) • ■ • (JV - jfc + 1). 

Replacing ft by n — 1 and IV by N — r in (2.12) there is obtained 

(ft-irWr(»- l,N~r) 

(2-13) (ft _ 1)(N _ r) <r) 


+ 


r! 


/r(n ~ 2, ff - 2r) + 


Multiplying (2.13) by 
obtained 


nN^ 

rl 


and subtracting the result from (2.12), there is 


(2.14) n" - (n 
r i 


1 ) ~fr(n, N) 


n(n - l)JV ert 
21 (rl)* 


f r (n-2,N-2r)~ 


By continuing this process, in a manner similar to that used for the case r = 0 
there is finally obtained 


fr(n, N ) = *1* 

(2.16) 


nN 


(r) 


rl 


(ft - ir r + 


n(n - 1) N w 
21 (r!) 2 


(« -2) 


tf—2r 



(71 - 3)"'” 


+ 


By setting r = 0 in (2.16), there is of course obtained the value already 
found in (2,10), 

We may readily verify {2.16) for example, for n — 3, N = 5, r =s 2. If 
£i + %2 -f Ej = 5 and no 3 = 2, then the sets of solutions are (5,0,0),- (0,5,0), 
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(0,0,5), (4.1,0), (1,4,0). (1,0,4), (4,0,1), (0,1,4), (0,4,1), (3,1,1), (1,3,1), (1,1,3; 
and /j(3,5) = 3*51/51 + 6*51/4! + 3*51/31 = 93. From (2.15) there is ob¬ 
tained /j(3,5) = 3‘ - 3*5*4*2 5 /2! + 3*2«5*4*3*2/2I{2l) 2 = 93. 

The same method of procedure may be applied to evaluate 


(216) ^ n,N ^ ~ ^ailxat***^!' 


Thus, there is derived the result that 


fci + 22 + ■• • + x* = N, 
no x a* r, s, ♦ 


/r*(u 


(217) 


,,N) = n“ - n(- 

jN™(n - 2)™ , 

—2TFF— + 


JV w (n - l)"-* , N u \n - 1) 


«l 




N l,+, \n - 2) 


+ 


+ 


tf—r—i 


N w (» - ST* 

2! Cal)* 


)- 


n(n — 1)(« 


(rl)(s!) 

<w (n - 3)"-*’ 


-2)p 


3! (r\y 


or t. 


N lir+t \n - 3)^ 2r “' iV {r+z,) (Ti - 3)^"^ iV (3l) (rc - 3f- 3 ' 

T* r>i / rt »\ T oi r 


21 (r\y (»t) 


21 (H) (sl) ! 


31 (si) 1 


) 


We may readily verify (217) for example, for n = 3, N » 5, r “ 0, s = 2. 
If Xi + Xi + xt = 5 and no x = 0 or 2, then the sets of solutions are (3,1,1), 
(1,3,1), (1,1,3) and /«(3;5) = 3*51/31 = 60, From (2.17) there is obtained 
/oa(3,6) = 3 6 - 3(2" + 5*4.2 s /2) + 3*2(1/21 + 5*4/21 + 5*4*3*2/(2I) 3 ) = 60. 
It will be shown later (see section 8) that 


(a) 


/,(», A0 = /„(», AT) + ~- /,.(» - 1. N - a) 


(2.18) 


n(n - DAI 0 * 1 , , „ „ „ , , 

"* 21 (si)*' n ~ 2 ’ N ^ + 


(2.19) 


f,(n, N ) = Un, A0 + ~p /„(« — 1, AT — r) 


, n(n - 1 )N iir) r n M oN , 
21 (r!) 2 f r *( n *^ r ) H" 


From (218) and (2.19) there may be derived, by 
employed in deriving (215), that 

/,.(«, N) = /,(», W-^Mn- 1, AT - .) 

( 2 . 20 ) 

*n(*n. — 1 }N (2 

+ 2 ! W 


a method similar to that 


■) 

-/,(»- 2, JV- 2s)- 


Thia latter result also follows from (2.17 and (2.15). 


• « • 
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Let us now consider the following generalization of (2,1). There is desired 
in terms of N, n, r, a x , as , • ■ • , a n) the value of 


(2.21) F r (n, N, a lf a 2 , • • • , a n ) = 2 


N\ 


£l! Xjl • • • $ n l 


aV a? - • • a Z n 


where a x , ai, • • • , a n , are constants and the summation is for all values of 
%\, Xi, such that £i + xz -f • ■ • + x n — N and no x = r. The method 

of procedure is the same as that for the case already considered, viz when 

££l = 0-2 = • • ■ = On = 1- 

The sum in (2.2) may be rearranged into the sum of a number of terms as 
follows: 


( 2 . 22 ) 


IVI 


Xll X2! • • ■ x n \ 

N\ 


aV O2 1 • • • xi + X2 + ■ * • 4. = JVj no x = r ; 


_r 

Oi yy 

r! ^£2! • ■ • X n l 


a? ■ ■ 1 ah* + * * • + -f 


JVI 


a* 1 • ’ • 1 , 


r! " Xi\ • • • x„-i! 
x\ + Xi + * • • + •'Tn-i = A r — r, etc., no x = r; 


al-'-ai 


N I 


affi 1 ■ • • C + 




(rl)» 


Xh+l\ ■■■ x n \ 


+ 


a„-*+i • • ■ an 


N\ 


aV'- a;-£> 


(H)* ^ x\\ ■ ■ ■ En-jfel 

xi + ^ • -f x n -k - N — hr, etc., no x = r; 


For convenience, let us write 

A(n, N) = {a 1 + a* + ■ • • + Un)^ 

Ai(ll — 1, iV) = (ai + ■ ■ ■ + cn-i + Gi +1 + ■ • ■ + &n) N 

Aij(n — 2, N) = (ai + • ■ * + a^ x + <^+1 +-h + <*/+1H-b &n) N 


(2.23K 


G T (n, N) = F r (n f N, oi, 02, * * ■ , a n ) 

G T (n - 1, N, ai ) = F r (n — 1, JV, &i, to, - • • , a,_i, a,'+i, • ■ 4 , a*) 

G r (n — 2, iV, a,-,a/) = F r (n —2, AT, a,, • ■ •, a*_i, a,-+i, ■ • ■ ,aj-i,a i+lt • • ‘,a n ) 


so that (2.2) may be written as 


(0 n 


(2.24) 


4(n, N) = GXn, N) + T, a r , G r (n - 1, N - r, a,) 

T I i 

\riir) n 

+ ^OiOlf?r(ft - 2, AT - 2r, a if a f ) + 


(t ^ j, etc.) 
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From (2.24), there are obtained n equations 


(2.25) 


A,in - 1, 2V - r) = Q,(n - 1, N - r, a,) + W 

rl 


trl 


Z) a- Grin -2 t N - 2r t a ,, dj) -f 
J-l 


- l> 2, ■ ■ •, ft, jV 1) 


Multiplying (2.25) by a T iN {T) /r\ and subtracting the result from (2.24), there 
is obtained 


'« /,rAr {r) 

A[n,N) - E ~~—Ai{n - 1,AT - r) « G r {n, N) 


(2.26) 


f=l 


r! 


N 


l2ri « 

2 d T iChG r {n — 2, N — 2 r, a,-, a,) — 


21 (r!) 2 


(f ^ i, etc.). 


Continuing this procedure, there is finally obtained 


N 


<T> 


(2.27) 


Grin, N) = F r in, N } ctij (h , ■ - •, tf n ) - 4(w, JV) — 

n jy(2r) n 

Z) - l,tf - r) + 2 - 2>N - 2r) - 

f-1 ■“> V U tHl 


(s ^ jj etc.) 


Similar results are obtainable for 

( 2 . 28 ) Grt ■ • >t s f^ra<. .((ftj Nf (Jl, flu j " ' " , Oji) = ^ 




a: v l ocjl • ■ * je n l 


aV a? 2 • ■ ■ <0 


where the summation is for all values of «,• such that £j -f a?z + 
and no x - r, s, > * • , or l. 

Thus, it will be shown later (see section 8), that 


+ Xn = N, 


(2.29) 


, J\T< 8 > 11 

Grin, N) =r {x w (n, JV) H- r 2 <*SGrf(n - 1, - s, a,) 

51 j^i 

jj no » 

+ STPlJi ^ <*5aJG„(ti. - 2, N - 2s, a { , a,) + ■ • - 


(i j, etc.) 


Corresponding to the derivation of (2.27), there is obtained from (2.29) 
the fact that 


(2.30) 


71tW n 

G„{n, X) * 0,[n, N) ——r~ E *i (?,(»- 1, N - s, a.) 

SI i-I I 

^{2|> 7* 

+ o 7777 r« E flJajCMn - 2, - 2s, a if a ? ) - • • 


21 (s!) s 




ii ^ j , etc.) 


3. The problem to be studied. Consider a trial in which one of n mutually 
exclusive events may occur, with the respective probabilities of occurrence 
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Pit Pa j * *: ? pn where pi + Vn + ■ • ■ + p a — 1. The probabilities of the 
various combinations of events which are possible in N trials arc given by the 
terms of the expansion of (pi + p s + •«• >+ 

In the N trials some of the possible events may not occur, others may occur 
one, twice, etc. It is desired to study the distribution of the number of events 
which do not occur; the distribution of the number of events which occur once 
each, etc. The simultaneous distributions of the events above described are 
also to be studied. 

For example, the possible event may be the occurrence of a digit. A study 
of a sequence of random digits, in sets of ten, yielded the following three 
sample sets. 


0 

1 

2 

CO 

4 

5 

6 

7 

8 

9 

1 

0 

2 

1 

1 , 

2 

1 

0 

0 

2 

1 

1 

1 

1 

1 

1 

2 

0 

1 

1 

0 

0 

2 ; 

1 

2 

1 

2 

1 

0 

1 


Fig. 1 


In the first set three events do not occur, four occur once each, and three occur 
twice bach. In the second set one event does not occur, eight events occur once 
each, and one event occurs twice; etc. 


4. Distribution of the number of events not occurring. To obtain the distri¬ 
bution of the number of events which do not occur, there is applied to the 
expansion of (pi + pi + • * • + pn) N a procedure similar to that employed 
in section 2. 

Thus, if ir r0 represents the probability for r events not occurring, then 


7T00 


7T10 


= E 


-r 


m 


$l1 X%\ • ■ ■ x n \ 

N I 


Vl'p? •" Vn, Xi + X* + 1 1 * + S„ = 

no x = 0; 


(4.1) 


Xi\ 


V? Pa" +-h 2 


Nl 


\V X l • ' • Pn-l, 


' ' '“ad • ■ ■ a:„—i! 

xi + $a + ■ ■ ■ + a*-i = N t etc., no x - 0; 


T r o 


-s 


JV! 


N\ 


\V? Vn~r 3 


-j-—j Pm-V • * ■ + ■ ■ ■ + S “T. 

JJr+il 1 1 f Xpl X\\ ' • Xn«-r« 

»i + *»+■■• + a*-r - N) etc., no x - 0; 









134 


SOLOMON KULLBA.CK 


Employing (2.21), we may write (4.1) as 
iroa = Ffa ^t Pi j P^ ) ' ' 1 > pO 

(42) i TlQ = ^ - ^’^ 2 ' ~~ " * j P«-0 

L ir rfl - F 0 (tj. - r, N, p r+] , ■ • ■ f p„) + * ■ • + Fo{n -r,N,p i, • • • , p n , r ) 
Since pi 4- P 2 4- ■ * • -f p„ = 1 there is found from (2.27) that 

iroo = 1 - £ (1 - PiY + A £ (1 - Pi ~ Vi) N 

i-i i,i»i 

£ U - p< - Pi - Vkf + • • • 

V ■ ip? |*=1 

*■» = £ (1 - p,)* - £ (i - p.- - p/) v 

t-l i.i-l 

+ ii £ (i - p< - Vi - vi) N — 

2! 


(4.3) 


irao 


= A, {£ d - p< - p/)* - £ (i - pi - pi - VkY + • ■ ■ 

2! (i.,-1 i.M-i 


TTao = A{ £ (1 - Pi - Vi “ Pa) W “ 1 ■ *} 
dl (i.Lfc-i J 

. (i t* j, etc.) 

The factorial moments 5 of the distribution given by (4.3) are easily derived. 
The first factorial moment is given by <ri = ino + 2tt 20 + 3tt S o + ■ • • + ^ro + 1 * ■ 
find the summation of the proper terms in (4.3) yields 


(4.4) 


vi - £ (i - Vi) N 


i^l 


In general, the r-th factorial moment, given by <r r — 2 — 1) * • • 

fc«r 

(ifc - T + l)ff«iB 

(4.5) Of — 2 (1 - Pa - Pi - “ 1 - VrY, (a 7*“ b, etc.). 

a.&r < -i r—1 

Indeed, (4.3) illustrates the fact that, if /(e) is the probability that a discon¬ 
tinuous variate takes the value x, then 8 


m 


/W = 4 1 

E! jt—a 


1 J. F. Steffensen, Interpolation (1927), p. 101. 

* J. F. Bteffensen, "Factorial Moments and Discontinuous Frequency Functions" 
Skandinaviak Aktuaneiidskrifl, Vol. YI (1923), pp. 73-89. 
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The moments about any constant of the distribution given by (4.3) may bo 
derived from the factorial moments by the relation 7 

(4.7) E(x - a) T = (1 + <riA + tr 2 A 2 /21 + • • ■ + c t A t / rl).| r ft = ^a) 

where A is the difference operator of the calculus of finite differences, and 
is replaced by (—a) after the indicated operations have been performed. 

Of special interest is the case when pi — p 2 = * • • =. p„ = -, for which (4.3) 

becomes 


n 


(4-8) 



whore f a (n, N) and A"0 W are as defined in section 2. The probabilities in (4.8) 
are the respective terms of the expansion of 

For this case the r-th factorial moment becomes 
(4.9) (j f = n(n — 1) • ■ ■ (» — r + 1) (« — r) w /w v 

There is presented an example of the distribution (4,8) for the case n — N = 10. 


It is found that 




AO 10 

= 1 

A 8 0 10 = 16435440 



a 2 o 10 

= 1022 

A 7 0 10 = 29035200 

(4.10) 

H 

a 3 o 10 

= 55980 

A a 0“ - 30240000 


a 4 o'° 

= 818520 

A B 0 10 = 16329600 



A s 0 10 

' = 5103000 

A 10 0 1D - 3628800 



/ 

*00 

= .000362880 

7Tbo = .128595600 



*10 

= .016329600 

TTeo = .017188920 

(4.11) 

-1 

*20 

= .136080000 

*70 = .000671760 



*30 

= .355622400 

*80 = .000004599 


i 

*40 

= .345144240 

7T9o = .000000001 

(4.12) 


fci = 

3.480784401 

m = 3.486784401 

< 

\<T2 — 

9.663676416 

<? = 0.992795358 

1 This result iB derived aa follows : (x — a) r 

= (1 + A)*.(-a) r ; E(x - a) 


X^l 


>{x) = (X) (1 + A)- /(a:)^. (-«)r = (1 + xA + x(x - l)V/2\ + ■ • ■)/(*))’(-»)'■ For 

iv bivariate distribution it may bo shown similarly that, symbolically, E((x - a) r (y — b) 1 ) 
*= [exp(ffi. Ai + (t.i Ai)| ■ (—a) , (—!»)' where - <r mn and Ai operates only on a and A t 

opeTateH only on b. A similar result may be derived for a multivariate distribution. 

• cf.' Whittaker & Robinson, op. cit. p. 7. 
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The observed distribution was obtained by distributing 200 sets of ten digits 
each, the digits being found in Tippet's Random Sampling Numbers. 9 The 
results obtained are given in Fig. 2. Three of the 200 observed sets wore 
illustrated in section 3. 

The agreement between observed results and theoretical values is gratifying. 


5 . Distribution of the number of events which occur once each* Let tt a1 , 
represent the probability that there are h events which occur once each. Thus, 
the various probabilities, obtained by rearranging the terms of the expansion of 
(pi + j ?2 + • • • + p n y, are as follows: 


Voi =* 


tfu — 


£ rr~^ — Pi 1 * ■' P*”> ^ + * + •' ■ + = N > no .r = 1; 


%i + & + ■ 11 + aWi = N — 1, etc., no x =s lj 


Nl 


(5.1) 


m = pipa • • - pk S 


m 


vtXi •'•?* + "■+ Pn-M-i V* 




Nl 


L ^*1 In-Jfe 

—*-i Vl Pn-k, 

V\\ • v x n -h\ 

®i -b *2 4- • “ + aw* = N - fc, etc., no x - 1; 


No, of events 
not occurring 
* 

Observed 

frequently 

/ 

Theoretical 

frequency 

1 

s(x-l)/ 

Observed 

parameters 

0 

0 1 

! 0.08 

0 

0 

ci = 3,46 

1 

8 

3.26 

s 

0 

1 ci ~ 9,61 

2 

•22 

27.22 

44 

44 

x = 3.46 

3 

i 72 

71.12 

216 

432 


4 

72 

69,02 

288 

864 

Theoretical 

5 

21 

25.72 

105 

420 

Parameters 

6 

4 

3,44 

24 ' 

120 

ci = 3.49 

7 

1 

0.14 

7 

42 

va — 9.66 

8 

0 

0,00 

0 

0 

m — 3.49 

9 

0 

0.00 

0 

0 

<r 2 = 0,99 


200 

200.00 

692 

1922 - 



Fig. 2 


* L. H, C. Tippet, Random Sampling Numbers, Tracis for Computers, No. XV (1627), 
London. 
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In view of (2.21) and (2.27)j it is found that (5.1) becomes 
(ttm = 1 — N 1 — 4- ^ Y! ViViil ~ 


= 1 - £ pr(l - p<)* 1 + ^ 2 P* Pj(l - p.- - Pf)" 2 - 

Vl 41 (,}-! 


(5.2) 


l^rii - p<(i - p.r 1 - (iv -1) ^ p f p,(i - pi - p,) N 2 d- • • ■ 


WV-i)/^ „ , y _ 3 \ 

irsi = --VZ^ PiP^ 1 ~ Vi “ Pj) -'•**> 


(i ^ j, etc.) 


From (5.2) there is readily derived the fact that 
v r = W - 1) ■ • ■ (N - r + 1) 

(5.3) » 

X P«P* • 1 ■ pr(l - p 0 - Vb -- VrY (a ^ 5, etc.) 

d.&r 1 *» r«l 

For the case in which pi = pa = • ■ ■ = p n - the distribution in (5.2) 
becomes 


*n = (j) M n > if) 

ttu = Q) nNfi{n — 1, N — 1) 


(5,4) . (LV?(n r y_ 2j * _ 2) 


= Q) ("V <r ’/.(» -r,N-r) 


where fi(n, N ) and N (r) have been defined in Bection 2. For this case (5.3) 
becomes 

(5.6) <r r = n (r) N {r \n - rY~ r /n N 

Evaluation of (5.4) and (5,5) for n = N = 10 yields, 


( 6 . 6 ) 


TTor - .00811639 

x n = .27052704 

xsi — .01632960 

xu = .04794633 

X5L - .15621984 

x 9 i = .00000000 

X21 = .14082336 

X8i - .12700800 

xioi = .00030288 

xai = .21089376 

X7i = .02177280 



3.87420489 
= 13.58954496 


m = 3.87420489. 
a = 2.45428632 J 


10 For the case n = N = 10 there cannot be 0 events occurring once each, since then the 
tenth event rauBt also occur once. 
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' The observed distribution, given in Fig, 3, was obtained from the 200 sets 
previously considered. 

The agreement between the observed results and theoretical values is 
gratifying. 

6. Distribution of the number of events which occur r times each. Let 
Tr kr represent the probability that there are k events occurring r times each. 
Thus, the various probabilities, obtained by rearranging the terms of the ex¬ 
pansion of (pt + P 2 + • ■ * + p n ) N , are ns follows: 


No. of events 
occurring 
once oacn 

X 

Observed 

frequency 

f 

Theoretical 

frequency 

xf 


Observed 

parameters 

0 

1 

1.62 

0 

0 

ai ^ 3.906 

1 

10 

1 9'. 58 

10 

0 

= 14.000 

2 

30 

28.16 

60 

60 

x = 3.905 

3 

37 

42.18 

111 

222 

s 2 *= 2,656 

4 

62 

64.10 

248 

744 

Theoretical 

5 

27 

31.24 

135 

640 

Parameters 

6 

22 

26.40 

132 

660 

= 3.874 

7 

3 

4.36 

21 

126 

ffa *= 13.590 

8 

8 

3.26 

64 

448 

m= 3,874 

9 - 

0 

0.00 

0 

0 

«t 2 ~ 2.454 

10 

0 

0.08 

0 

0 



200 

199.98 

781 

2800 



Fig. 3 


( 6 . 1 ) 


ITOr =■ 


Nl 


Tlr 


= Spi 1 ■ 1 * Vn, Xi + Xa H-+ x n = N, no x = r; 


N I 


t\ • ■ ■ x n l 


V? ■ ■ * 




m 


si 4- xj -f ■ ■ - + x n -i =s N -r, etc,, no x — r; 


TTfr ~ 


VWl ‘ ‘ ■ Vk 


Nl 


(H) k 




iPfc+J ' * 1 

i Pn-fc+l " * Pn V 

T /„ i v Jt J-U 


Nl 


vV • • ■ v\-k, 


(r!) fc ^ Xi\ s„_*! 
xt + + ‘ * 1 + - N - fcr, etc., no x = r; 










DISTRIBUTIONS DERIVED FROM MULTINOMIAL DISTRIBUTION 


130 


In view of (2.21) and (2.27) it is found that (6.1) becomes 


f jU <r) * 

^ = i- J 4 r 2^(i- 2J( r r + 


( 6 . 2 ) { 


%lt = 


r I f -1 
(» 
r! 


21W 4 






SpKi-pi)^ r - -fi jb p<pI(i -p<- Pf) w ~ Jr -t—} 


JV (2r) f A 

■*=ftwiS p, ‘ m ~ p, ~ p ‘ ] ~■ 


(i ^ j, etc.) 


From (6.2) there is readily derived the fact that 
(6.3) <TJt = 7-Tr^ 2 Pap6 ■ * * Pfc(l — Pa — Vb — 1 1 ■ — Vk)^ } (a &, etc.) 

V*J a.tv.fc-l 

For v = 0,1 (6.2) and (6,3) reduce to the values previously derived. 

For the case in which pi = pi = • • • = p n = -, the distribution in (6.2) 

n 

becomes 


(6.4) 



where / r («, N ) has been defined in section 2. For this case (6.3) becomes 
(6.5) n = »**»% - k)"~ kr /n" 


7. Simultaneous distribution of the number of events not occurring, and of 
the number of events occurring once each. The ‘probabilities for tho simul¬ 
taneous occurrence of ..the various combinations of the number of events not 
occurring, and of the number of events occurring once each, are given by rear- 
ranging l the terms of the expansion of (pi + Pa + • • ■ + Pn) 1 *; and are given 
as in Fig. 4. 

In Fig. 4 none of the subscripts take on equal values simultaneously, and ffoi 
has been defined in section 2. Summation of the values in the fc-th column 
of Fig. 4, yields the probability that there are (k — 1) events not occurring. 
Comparison with (4.2) yields 

Ft(n, N, pi , pa, • • •, J>.) = Gain, N ) = Gain, N) + N £ -1, N — 1, pi) 

(7.1) 

vr(2> * 

+ -oT E Vi Violin - 2, N - 2, Vi, Pi ) + ■ ■ ■ , (t 5* 3, etc,) 

*,/-1 
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E - 1 > N > Pf> 
1^1 


1 N'£ t v&i(n-l,N-l ) Pi)N E pi<?oi 

<-i 

(n - 2, N - 1, p<, pO 


N tt> A A, 

2 ~2f .^P'P^ 01 ~2T i 1 P , 'P^ C " 

{n-2,U-%ViVi) (»- 3, AT -.2, pi t p h p k ) 




t of "■ 

o, b l >" l i,a,jS l >" ) ^—l 

p<tpt *' ■ pi(7oi(ft — r — a, 

pa, 

Pa, , ?>,) 


Fig. 4 


, Summation of the valueB in the &4h row of Fig. 4, yields the probability 
that there are (fe - 1) events occurring once each. Comparison with (5.2) 
and (2.27) yields 


Fi(n, N,p i, pa, ■ ■ ■, ?0 = Gi(n, N) = Quin,N) + E Gw(n ~ 1 , N, p t ) 

(7.2) 

+ i 2 G al (n - 2, N, pi,Vi)+ , (i ^ j, etc.) 

21 L/-1 

If we use x to represent the number of events not occurring, and y the number 
of events occurring once each, then it is found that 

E(x w y u) ) = <r n = 7/ (,) E Vapb ' ■ ■ p,(l - p a - * ■ • - p t 

(7.3) i,p-i 

- Pa - ■ • ■ ~ V P ) U ~\ (a ^ b, etc.). 

If At represents the average number of events not occurring, when there 
are fc events occurring once each, then from Fig! 4 there is found that 

E Gw. (ft ~~ 1, N, pi) + 2 E GWCn — 2, N, pc, pj)/2t 

i-l i,/-l 


3 E ^01.(71 — 3 , N, p<) p{, Pk)/Si 4 - 

(7.4) Ai-.---- 

N) -f- E Gain — 1, N , pi) 


(i 7 * j , etc.) 


+ E Goi(w “ 2, N, pi, p ^/21 + 

t.M 


■ 1 * 
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In view of (7.2), (7.4) reduces to 

(7.6) • = (g Gin, N, p,)) f Gin, N) 

A similar procedure, yields, in general 

n 

X V°V* ' ' * PkOiin - k - 1, A - fc, p a , jib, , pjfc,p<) 

(7.6) o&i ^ - 

£ 7W ■ ‘ * PkGiin - ft, A - k, p a , pi, ■ * ■, p fc ) 

n,b, ••■ h i=l 

(a 6, etc.) 

If iykQ represents the average number of events occurring once each, when 
there are k events not occurring, then from Fig. 4, there is found that 

w(s P< Gain p.) + 2 (W - 1) 


(i 7* j, etc.) 


X piVjGoi{n - 2, A - 2, p,)/2l + • ■ * f 

(7.7) m = -**=*-tt- I 

GM», A) + A X Pi -(?oi(r - 1, A - 1, p f ) ' 


+ A <a X Pi'Pi<7oi(n - 2, N - 2, p () p } )/2! 

w-i 


In view of (7.1), (7.7) reduces to 


(7.8) 


(n± P< 


G*{n -1^-1, Vi ) / Gt(n, N) 


A similar procedure, yields, in general 

TV 

N X p a Go(n~k-l,N-l,p a) Pb, *•' ,Pk,Pr) 

(7.9) ijfa = - : -■ (o b, etc.) 

X Gain - k } A, p tt , pb , ■ ■ • , Pfc) 

a, b t m ■ ■,A^l 

For tho caso in which p\ = pa = 1 * • = p n — -, as may be found from Fig. 4, 

ft 

the probability for the simultaneous occurrence of r events not occurring, and 
s events occurring once each, is given by 

(710) ■ (n) ^- fo ' (n ~ r ~ s ’ N ~ s) 

For this case (7.1), (7.2), (7.3), (7.6), and (7.9) yield respectively 


(7 n) Mn, N) = /oi(ft, A) + nNf n (n - 1, A ~ 1) + ^JjV ( 7oi(7i - 2, 

' . , A - 2) + 

(7.12) fi(n, N ) = / 0 i(n, A") + r/oi(« - 1, A) + - 2, A) + 

(7.13) <r r . = N (l) n {rU \n - r - sf-fn” 


(7.10) , 
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(7.14) Q z kl - (« - k)fi(n - k ~ 1, N - k)/fi(n - k, N - k) 

(7.15) ]#jt 0 “ N(n - k)J o(?i - k - 1, N - l)/fa(n - k, N) 

Let us consider again the case when pi =* pa = •* * = V* " ~ and n = N = 

"Rvn.1nfliinfir (7 141 nxiH (7.1 S'! hv mentis of (2.15') vields 


10. 


(7.16) 


(7.17) 


r _ 

qXqi — 

5.71 

0$H 

= 3.02 

o£'ll — 

5.21 

($81 

- 2,10 

£$21 *= 

4.51 

0^71 

= 2.00 

0^3J — 

4.10 

piei 

= 1.00 

jtfu = 

3.28 

0X»1 

- 0.00 

fi0oo ~ 

10.00 

1060 

= 1.83 

1010 “ 

8.00 

1090 

- 0.89 

| 1^20 " 

6.16 

1070 

- 0.27 

1030 = 

4.50 

1080 

- 0.02 

J0oo = 

3.05 1 

1000 

= 0.00 



■ 

Number of eventa not occurring 

— i _ 


1 

0 

1 

2 

3 

4 

5 

6 

D 

& 

9 

1 

1 $ 


0 






■ 

■ 

1 



1 

7.00 

■3 

1 



1 


1 

1 6 

3 

1 





c3 

4) 

0) 

2 

! 




16 

13 

,1 

1 





§ 

hO 

3 





35 

2 





37 

4.05 


4 




42 

20 






62 

3.32 

« 

O 

gig® 




'27 







27 

3.00 

CO 

13 

g 

Kl 



19 

3 




I 




2.14 

£ 

o 




3 ' 





1 



1 

3 

2.00 

u 

V 

■s 

8 


8 









8 

1.00 

1 

9 











0 



H 


- 


1 







0 




o 

8 

22 

72 

72 

21 

4 

1 


0 

200 



0 

L 


6.16 

4.46 


1.81 

1.25 


-<r- 

r 

1 



Fio, 5 
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The distribution in Fig. 5 yields d n = 11.89, (7.13) yields c n = 12.07959552. 

The agreement between the observed results in Fig. 5 and the theoretical 
values in (7.16) and (7.17) is gratifying. 

8. Simultaneous distribution of the number of events which occur r times 
each, and of the number of events which occur s times each. The probabilities 
for the simultaneous occurrence of the various combinations of the number of 
events which occur r times each, and of the number of events which occur s 
times each, are obtained by rearranging the terms of the expansion of (p t -f p% 
+ T ' ■ + Pn) Y . If TTfcr.u is the probability for the simultaneous occurrence of 
h events which occur r times each and l events which occur s times each, then 

(kr+h) n 

= M ?l /-i\i ^^ V a ' " * P^Ptf * ' Px^?rj 

(8.1) kill (r!) (s!) — fc x=i 

(n-k-l,N -kr-ls t p a ,'- - ) 'Pk,V*>"', Px), (« ^ b, etc.) 

where G„ is defined in section 2. 

From (8.1) and (6-2), there is derived, in a manner similar to the derivation 
of (7,1) and (7.2), the result that 


F r (n, N,pi r - - • ,p») = G r (n, N ) = G r s(n,N) -|- —p X)pi G rs {n - 1 ,N-s,pi) 

S\ f=i 


( 8 . 2 ) 


r( 2 «> ti 


+ .S $ ti G "( 71 - N ~ 2s » Vi . Vi) 4-, ii ^ j, etc.) 


and a similar result by interchanging r and s in (8.2), 
For the distribution given by (8.1), it is found that 


rMit-Ha) 


(8.3) 


crjti = 


Pa • ’ ■ PkVa 1 '■ * PX 


(1 “ p a - ■ • • - pn — p a - * • ■ - V^ kT ' l % 


{a 7 * b, etc.) 


If represents the average number of events which occur r times each 
when there are I events which occur s times each, then from (8.1) and (8.2), 
in a manner similar to the derivation of (7.6), it is found that 


r&l a — 


(N - ls)' T) £ tiaVa ’ ' • PxG,(?t - 1 - l t N ~ r - is, pa, pa, ’ 1 • , ps) 

a,a, 1 * • ,\™1 


(8.4) 


r\ Yi Pa - ’ ■ -l,N - Is, p 0 


,Px) 

(a ft etc.) 
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If represents the average number of events which occur s times each 
when there are k events which occur r times each, then by interchanging k and l, 
and t and s in (8.4), there is found 


(N-kr) {,] 2 v't"'Plv*GT(n-'k-l,N-kr-s,p a ,"-,$!<,$*) 

a ( '''.fc.e—l _ __ 




( 8 . 6 ) 


S Pa ■ •' PkQM ~k,N — hr, fa,-“ , pt) 


(i a b, etc.) 


For the case when Pi = P 2 “ ■ ■ • « p n = it is found that (8.1), (8.2), 
(8.3), (8.4), and (8.5) respectively yield 


(8.6) 


n ' M ~ Q 


m (rl) 1 (si) 


j f ( „(n - k - l t N - hr -U) 


nN 


(ff) 


(8.7) 


/,(», JO = /„(», JO + fM -1, N -«) 


, n(n - m tu> 0 „ ,. , 

+ -m - ar /"V* - 2, J/ - 2s) + 


2! (si)* 

(8.8) »h = n W) N tiri,, \n - it - J)"-^ ,, /(rl) t (a!) 1 n“ 

(8.9) A a (re - 7)<1V - is) (,) /.(« - 1 - !, JV - r - W/r!/.(n — i, JV — i») 

(8.10) ,ff», = (« - 4)(J7 - kr) M f,(n - k - 1, N - kr - s)/sl/,(n -k,N-ti) 

For r = 0, s = 1, the results derived in this section of course reduce to those 
already derived in section 7. 


9. Conclusion. It is clear that the same method of procedure may be em- 
ployed to study the simultaneous distribution of the number of events which 
occur r s s, ••• , t, times each. However we will not continue the discussion 
any further. 

We have thus seen that the multinomial distribution serves as the back¬ 
ground for the study of a number of distributions which have certain practical 
applications, 

The theory discussed herein has been illustrated by several examples which 
yielded gratifying agreement between observed and theoretical results. 


Washington, D, C. 
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A PROBLEM IN LEAST SQUARES 
By Jan K. Wisniewski 

51. We are dealing with two variables, the observed values of which are 
denoted x and y respectively. The pairs of observations are divided into r 
groups, numbering n\, , • * • n r pairs. Suppose in each group we determine a 

regression equation of the following shape: 


= di + + • * ■ rmx 1 ( 1 ) 

where y+ denotes the value of the "dependent" variable obtained from the 
regression equation, while y without any subscript denotes its observed value, 
The r regression equations of type (1) are not assumed independent; on the 
contrary, we postulate that 

r 

E Vi a* Oq 4- M + ' 1 • WflX 1 (2) 

1 


be fulfilled identically in 3 ; oo, to, • • • fflo being predetermined numbers. This 
leads to the following conditions: 


r r r 

2 = Oo 2 hi = 6 0 • ■■ E Mi = Wo- 

1 1 1 


(3) 


The magnitude to be minimized under the theory of least squares is now 

I 

1 


z = E Hily - ( a < + M Wii 1 )! 2 + Er|j/ - (*> - 

The normal equations derived from (4) are of the following shape: 


12 


(4) 


4 " 1 ■ ■ Tflj ® 


U}Or) 4 “ Hr E fl i + E/ £ 4 “ ^E btj (Er x) 

+ (E (Sr «') = E iV - Er V + 4 - &0 Er « + ' ' 1 Wc Sr 


( 5 ) 
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flj Hi x + a,^) (Hr a) + h Hi a* + bt} (Hr a 2 ) 

Hi Z ,+1 + (H Wi) (Hr a ,+1 ) = Hi mj - Hr xy+ «0 Sr a 


-{- •' * w>i 


X 

+ bo Hr a* + ■ • ‘ Wo Sr a ,+l 


(5) 


«, Z,- + (t, a<) (L, x') + i< E, + (E &<) (E. O 
+ •••)«,• Zi ®" + (z mi) (Zr * ! ‘) = Z( ®V — Zr 

“l - do S "* a 4^ ^0 ^ -/ r 55 4“ * * ' Wo X 


Hi meaning a summation extended over the f-th group. As (1) is of the 
a-th degree, we have (s 4- 1) (?' — 1) parameters to determine and as many 
equations, the problem thus being in theory solved.* As to the numerical 
solution, Doolittle's method or any other may be applied. We do not enter 
at present the question, how much labor would the actual solution require. 

Examples. Allen and Bowley in their book on "Family Expenditure” 
(London, 1935) assume the expenditure on some defined item / to be a linear 
function of the total expenditure e 

f = he 4* c< (6) 

Evidently Sfc *= 1, Ec ® 0 (cfr. pp. 10-11). Another example I give in a 
paper on seasonal variation, which appeared in "Economic Studies” III 
(ICrakdw). , Actual values y of a time series are assumed to be linear functions 
of certain "normal” values x 

V - a 4- bx (7) 

a and & changing from month to month but constant from year to year. Then 

H & - 0 , H & * 12 . 

§2. Methods of solution in special cases. The generally recognized methods 
of solving normal equations become extremely laborious as the product (s 4- 1) 
(r — 1) grows large. As a matter of fact, the amount of computer's work is 
approximately proportional to the cube of the number of parameters to deter¬ 
mine. Therefore short cuts seem to be indispensable, A most elegant one is 
at our disposal in the special case 1 when the values of x in the several groups 

* Tho remaining $ + 1 parameters a r , b r , - ■ • m are, of course, found from (3). 

’ This seems to be realized in Allen and Bowley J s work,. 
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are identical, or, at least, the sums m, £< a;, x\ ■ • - £,• x 2t are identical 
in i. Instead of (1) we shall write 

y< = A* + BiXi + • * ■ MX, (8) 

where Xi , Xi , ■ ■ ■ X, are orthogonal polynomials, i.e, such that, 2 X t Xj ~ 0 
if and only if i ^ j. In general, X h = X h ,+ aj_i X h ~ l -f • • * , the coefficients 

being rational functions of n, 2 a, 2 ® 2 } * ’ 1 £ a 2 ' -1 . 

The conditions (3) can now be replaced by a set of equivalent ones, viz. 

’t,A l = A, (9) 

1 L 1 

How the actual values of Ao t are found, will be shown in the next 

paragraph. The solution becomes now very easy, as the normal equations 
for the determination of each set of r — 1 parameters are independent, i.e. we 
can calculate the A’s separately, then the B's etc^, the order of solution being 
of no importance, Moreover the shape of the normal equations permits of 
considerable simplification of solution. Suppose we have to determine the 
values, of the coefficients K, corresponding to Xh . The normal equations are 
now—after certain simplifications— 

2tK\ Ki + Kz + ' ’ ’ &r -1 = Xhy — ■ Xhy ) + ffo 

Ki + 2ifi + K, + ••• K r .1 = — (Z* x„y - E, X,y) + I(„ (w) 


K x -|- -|- K$ ~h ■ f ■ 2K r -i 


Zxl 


(E- Er X,y) + K„ 


Adding these equations, dividing the sum by r and substracting the quotient 
from the yt h equation, we get 



i / v ^ XhV 

Av ~zxl 



(ii) 


The first member of the right hand side of (11) should be regarded as the 
principal term: this is actually the value we would obtain for K } -, were this 
coefficient independent from the other iTs. The second member is a correction 
term, the necessary amount of correction being distributed equally among the 
several K’s. The simple solution given by (11) is only possible if the sum 
2 Xi is the same for each group. From the definition of X h we see that it 
is equivalent to saying that Hi, £ 2 , • ■ • ^ be identical in i. As 

h increases to s, wo come to the condition given at the beginning of this para-* 
graph. 





148 


JAN K. WISNIEWSKI 


§3, If this condition is not fulfilled, we can, indeed, replace the power series 
in a; by orthogonal polynomials the second subscript being appended 
in order to show that the values of the X polynomials are no more identical 
for the several groups; these polynomials are now orthogonalized separately 
within each group, But we are no more able to predetermine the values of 
A 0 , Bo, ■ • ■ M 0 , os they depend on each other; this will be made clear a little 
later. Therefore we have to resort to an approximation: the values of the 
parameters will not be found from simultaneous equations, but successively, 
step by step, beginning with those corresponding to the highest degree of the 
independent variable, 

The values of (h , fc Q , • ■ * mo are given. It is evident that iuq - Mq , The 
j-th normal equation is now: 

Mi'Ll xl, - M,T,,XU + (E (ZrXl,) - 'LiX.. a -'L,X.. ( y. (12) 


We see at once that 


Mi- 


+ T*x..<h - ZtX.a 




Inserting this into /12/ we get 

/Cj X t ,jy 




M f = 


sr\ ZjiAhJ/ _ 

1 i Si-Xl-i 


Mo 




X 1 ZaU 

i 


(IS) 


(14) 


The second member of the right hand side of /I4/ is again a correction term, 
the necessary amount of correction being distributed in inverse proportion to 
Now we determine the value of L a , this coefficient corresponding 
to s — 1, the second highest degree of x, and calculate the several L’s from 
equations strictly analogous to (14) thus accomplishing the second step of our 
work, and so on, down to the ri's. L 0 is found from the following equation: 

h = u-t, [*:_.« • Af>]. (is) 

i 

To ota is now appended a bracketed i, this to stress its variation irom group 
to group. We see from (15) that before the several M's are calculated we are 
not in a position to determine La. On the other hand, if a'_t is the same for 
all groups, the second member of the right hand side of (15) simply reduces 
to «!_r«io and Lo can be determined in advance, i.e, before calculating the 
M’s. This is the case treated first (in §2). In any case, if no definite corre¬ 
lation is to be expected between (i) and Mi, the approximative method 
developed here should give very nearly correct results. The writer applied 
this method of solution to the simple problem of seasonal variation mentioned 
in §1 and found the results very satisfactory. 



A SIGNIFICANCE TEST FOR COMPONENT ANALYSIS 

By Paul G, Hoel 
1, Introduction 

During the last few years several papers and books have been written on 
various aspects of what has been termed component or factor analysis. This 
analysis has arisen from the psychological problem of describing the results on a 
series of tests in terms of a few distinct abilities or components. In much of 
such work it is claimed that there does not exist more than a certain number 
of components, the material discarded in order to substantiate such a claim 
being considered as due to random errors of sampling or errors of measurement. 
However, mere inspection of results or the calculation of standard errors of 
residual correlations is hardly sufficient to justify such conclusions, and there¬ 
fore a significance test of some kind is necessary. Hotelling 1 considered such 
a test but based it upon an uncertain analogy with the analysis of variance 
and upon the legitimacy of using standard errors. The purpose of this paper 
is to derive a test which is more general in scope and in which all assumptions 
are explicitly stated. 

If each test score is thought of as being made up of two parts, a true score 
and an error element, the assumption that there exists fewer components than 
the number of tests implies that the scatter diagram of the true scores will lie 
in a space of correspondingly smaller dimensionality. Consequently, an ideal 
test for the number of components would be one which would test the rank 
of the true moment matrix. In the case of normally distributed variables, 
this line of approach leads one to the sampling distribution of the generalized 
variance. Unfortunately, this distribution appears in unintegrated form; how¬ 
ever, by considering its moments it is possible to find a good approximation 
to this exact distribution for samples which are not too small. 

The paper proceeds by first finding two approximation distributions for the 
generalized variance, one for samples which are not too small and one for large 
samples. It then considers the type of population from which it will be assumed 
the sample was drawn, and finally applies the test to two numerical examples 
from recent literature along such lines. 

2 , Approximation Distributions 

Suppose that N individuals have been drawn at random from an n variate 
normal population whose distribution is expressed by 

*■ 

“2 

(1) P(xi, X2, • •* , Zn) = Ke 

1 Harold Hotelling, Analysis of a Complex of Statistical Variables into Principal Com- 
' ponentfl, The Journal of Educational Psychology, September and Ootober, 1933, pp, 21-25, 
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where .r,- = Xi — m lt An = 


A,- 




2(Tj tfj A 


, A is the determinant | py | and An is the 


cofactor of in A, and K = [ | 1 /(2ir)' l/a . If the observed values of the 

variables of the o-th individual are denoted by Xi„(i = I, 2, • • - , n) } then the 

generalized sample variance is defined as z = | | , where an = Yl {Xia — X») 

(Xj a — Xj). Wilks 2 has shown that in sampling from the population (1), 
the fcth moment of the sampling distribution of z is given by 

JN + 2 fc - 

"d-3 


J N + 2 k - 1 } r (N + 2k - 2' 


Jf * - A 


-It 


-) 




N- 2^ 


fnP) 


where A = iV" j An j. An inspection of the integrated form of the distribution 
of z in the case of n = 1 nnrl n = 2 suggests that there likely exists a function 
of similar form for higher values of n whose Jcth moment can be made to differ 
from ilfj; only in higher powers of terms which contain N 1 as a factor. An 
.investigation along such lines leads to the function 


( 2 ) 


where C = 


y-n n ,v-n _ i 

2 2 
a n 




N-n' 


g(z) = Cee*^ n 

»w=- H - ,a = Ag and ? — 1 — -- 


2 N 


It will be shown that the kth moment Ml of g{&) differs from M k only in terms 
of magnitude less than the second and higher powers of U l n/N or kn/N. 

Multiplying g{z) by 1 and integrating over the entire range of z will yield 
Mk , which turns out to be 


r (' 

Ml= A. 


N — n + 2ir 


a v* r (»*_?) 


Upon reducing the upper gamma function and performing successive steps of 
simple algebra 


/ -k-nt X — n + 2 k 


M k = (C K n 




= N nk aT k 2-" k (l -f 




N - n 


2fc — n- — 2 Jn 

T~ 


)( i+ 


2k — n — 4 fri 


N 


( 




1 1 2k — - n — 2k?}./n \ 
+ W )' 


8 S. S. Wilka, Certain Generalizations in the Analysis of Variance, Bioinctrika, Vol, 
XXIV, 1923, p. 477. 
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The terms in parentheses may be treated as the factored form of a polynomial 

of the nfcth degree in unity* Thus the quantities — ~ t etc., may be 

treated as the zeros with signs changed of the corresponding polynomial in 
x (say). As a result, the successive terms after the first in the non-factored 
form of this polynomial in unity are the sums of the products of these quantities 
taken one at a time, two at a time, etc. Upon performing this multiplication 
and letting = N n /2*A, M k assumes the form 

f 

where the neglected terms are in magnitude less than the second and higher 
powers of k?n/N or kn/N . If M k is handled in exactly the samo manner, it 
will be found that 


Mk = A 


f + ?~ 1 - 1 )-( y+ ?- 1 -*)- 

f N + U-n I y../ y + 


ink A-kci-nk( * . 3 


= .nri + 


N 


if. nk(n — 2fc + 3) 


^ 1 


2 N 


+ 


H) 
] 


L 2k -n - 2\ /. n\ 

+ 


where the neglected terms are of the same order of magnitude as those neglected 
in the approximation to Ml . Before a comparison of Mk and Mk is possible, 
the factor q~ k of Ml must be expanded and multiplied into the quantity in 
brackets. This operation yields the result 


Mi = </>* 1 - 


nk(n — 2fc + 3) 
2N 


4* 


* 


\ 


Thus Mk and Ml agree to within neglected terms. As a matter of fact, if 
the values of the neglected terms are'considered more carefully, it will be found 
that the actual difference between M k and Ml is considerably less than the 
given upper bound for the magnitude of neglected terms would indicate, For 
example, when n = 5 the first term in the difference is 6fc(/c — .9 )N ", while 
G25fc 2 iV~ 2 or 26ft 4 iV~ 2 is the upper bound for this term when only general results 
are used. The general formula for the first term in this difference has been 
obtained, but since the remaining terms have not been investigated and since 
the type of problems to which the distribution p(z) is to be applied docs not 
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justify this refinement, it tv ill not be considered here. Consequently, if one 
considers this distribution function as sufficiently determined by its low order 
moments and if one applies g(z) only to problems in which N is fairly large 
compared with n 2 , then the function g(z) will give a good approximation to the 
exact sampling distribution of z . Obviously, g{z) is identical with the exact 
distribution for the known cases of n — 1 and n = 2. It is not possible under 
the above expansions to vary the constants in the form of g(z) in such a manner 
as to obtain an approximation whose hth moment will agree with ilf* to within 
still higher powers of comparable terms. 

In order to test whether or not a sample value z » Z can be reasonably 
assumed to have been obtained in random sampling from a population of type 
(1) with fixed A, it is necessary to calculate the probability P of obtaining in 
repeated samples a value of z greater than Z. Thus it is necessary to evalua te 


P = 1 - jf g{z) dz. 


N — n 

Upon making the substitution x — n\/az, and letting p = n ——— 1 and 

u = = nN /^/1 1 - ( —IMN - 


«■)] this 


integral can bo reduced to the standard form of the incomplete gamma function. 
Hence P assumes the form 


(3) P = 1 - I(u, p ) 

where 

1 f uVp+1 

In many applications of this distribution it will be found that the values of 
u and p lie beyond the tabled 3 values of these constants. Consequently, it 
will often be sufficient to use the normal distribution to which the gamma 
distribution tends as N becomes large. This normal distribution will be 
considered next. 

Rather than obtain a normal approximation to giz) or the gamma function 
to which g{z) reduces after the above transformation, it is more illuminating 
to find the basic descriptive parameters of the exact distribution of z and from 
them obtain a normal approximation. Such a procedure will show how rapidly 
the distribution of z approaches normality with increasing iV. By using the 
recurrence formula connecting M k +i and M k , which can be found directly from 
the ratio of these two moments, and expressing the necessary moments in 


* K. Pearson, Tables of the Incomplete Gamma Function, Biometric Laboratory (1922), 
Univ, of London. 
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terms of M \, it can, be shown that these basic descriptive parameters are expres¬ 
sible in expanded form as follows: 


n(n + 1) n(n + l)(n - l)(3n + 2) , 

OAT "1 nA rn r " ‘ 


m 0 [ X 2JV ' 24 N* 

ff 8 = ^['- 2 iL_i ^ a -^ + 1 ) + ... 1 
9 L N N* + J 


ft = 


2(3n - 1) 

nN 


2 r 


_ (n + l)(5?i - 3) , 
1 - -- f 


2(3?i - 1)N 


ft 


oTi , 4(3» - 1)(4« - 1) , 
= d [ i H- ZZT? -b 


3 nN 




These values suggest that 


will likely be distributed approximately normally with zero mean and unit 
variance. As a matter of fact, by using the second limit theorem of probability, 4 
it can be shown that the distribution of w approaches normality as N increases 
indefinitely. Hence, for samples in which N is large compared with n 2 , it 
will be sufficient to compare the value of w arising from a sample z — Z with 
its variance of unity if a test of significance is desired. A better general ap¬ 
proximation could have been obtained by centering the curve at 4> 

rather than at <p] however, since there is positive skewness anc 
lies between these two values, there might arise some exaggeration in a signifi¬ 
cance test in doing so because the accuracy of such a test depends upon the 
accuracy of the approximation in the right hand tail of the curve. 

Inspection of (3) and (4) shows that the only population parameter upon 
which these approximation distributions depend is <f>. There are no assump¬ 
tions necessary about the population means, or variances, or covariances, 
except in so far as they may be related when the value of 4> is postulated. This 
means that either (3) or (4) enables one to test whether or not it is reasonable 
to assume that the sample variance z — Z arose in random sampling from some 
normal population with <f> equal to the postulated value, 


n(n +1)1 

21V J 

L the true mean 


3. Population Assumptions 


Consider the set of variables u\, u 2 , ■ ■ ■ , u n distributed according to the 
normal law 

fl 

“S 

(5) P(wij Uz f * * • f Wn) = K\G 

* See, for example, Trechet and Sliohat, A Proof of the Generalized Second Limit 
Theorem in'the Theory of Probability, Transactions of the American Mathematical So¬ 
ciety, Vol. 33, (1031), p. 533. 
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and the set of variables v\ , , • ■ ■ , v n distributed according to the normal law 

n a 

-2 c <*c 

(6) P(v i, V 2 , ■ •' f v„) = IQe 1 

where the it's are uncorrelated with the w's and with each other. The joint 
distribution of the u*b and ti’s is expressed by 

n * l 

-2 b|/B, v ( 

(?) P(wi, ■ * • , v n ) = IUe 1 1 

f 

Upon writing down the determinant of the coefficients of those 2n variables, 
it will become evident that any one of its principal minors of any order can be 
expressed as the product of a principal minor of [ bn ( with a principal minor of 
| a \ . Since the distributions (5) and (6) are normal, the determinants | 6,- 7 -1 
and | C{ | are positive definite; consequently the determinant of the coefficients 
in (7) must also be positive definite. 

Now consider the orthogonal transformation 


Vi = 


Ui + Vi 

V2 ’ 


t=l,2. 


n 


U { - Vi . ... _ 

Vi - — 7 =-, i = n + 1, 1 • *, 2 n. 

V 2 

Since the determinant of the coefficients in (7) is invariant under an orthogonal 
transformation, the resulting distribution of the y 's may be expressed by 

In 

-h dijl/iVi 

w Piy i ,vu ,y*n) = K*e 1 

where | da | is positive definite. 

In order to obtain the distribution of the variables t/i, f/ 2 , ■ ■ • , y n , it is 
necessary to integrate (8) with respect to the variables y n + i , ■ ■ ■ , yin over 
their range of values. If this integration is performed after the quadratic form 
in the exponent of (8) has been expressed as a sum of squares 6 with coefficients 
which are the ratios of principal minors of | da |, it will be clear that the inte¬ 
gration leaves a quadratic form in the exponent which is also positive definite. 
Hence after the transformation = \/2yi{i = 1, 2, - - ■ , ri) the distribution, 
function of the variables a- m + v t (i = 1, 2, * ■ • , ri) must be normal and 
may be expressed by (1). Thus it has been shown that if the true parts m 
of the variables #,• are normally distributed without error and if the error parts 
ii{ are normally distributed but are uncorrelated with the Ui and with each 
other, then the variables Xi possess a normal distribution, The advantage of 


s See, for example, Risacr and Traynard, Leg Prineipea do la Stafciatique Mathematique, 
1933, p. 226. 
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this formulation will become evident when the parameter 0 is expressed in 
terms of the parameters of (6) and (6). 

Since the ris are uncorrelated with the u's and with each other, the variance 
a“i of Xi is the sum of the variances of and V{, while the correlation pa be¬ 
tween Xi and Zj may be expressed in terms of the correlation p'a between u { 
and U/ and the variances u* , u) t v*, v\ of w,-, Uj, vt, v ,• respectively. These 
relationships are 


(9) ffj = ju5 + n , and Pij = 


r 

PH 


+ 4 / m !)( 1 + vf/f$) 


(i 3* j)- 


For simplicity of notation let X» = . Now it is well known® that 0 can 

be expressed in the form 


^ 2 2 

0 = ffi ff 2 ■ ■ • ffn Pij 


If the values from (9) are inserted in [ p,-y | and if the resulting denominators 
of elements are factored out, 0 will assume the form 

S 2 ‘l -n 

(TiffS ' 1 * ff„ .D 


0 = 


where 


B = 


(1 -J- Xl) * * * (1 *f- X n ) 


1 4~ Xi P 12 Pin 
! 

Pl2 


/ 

Pl7» 


1 +X„ 


Following the methods of confluence analysis, 7 B can be expressed as follows: 


n n 

B = It 4" H { -f- XftXp f2) a p( 4* • 1 * -j- X 1 X 2 1 1 ’ X n 

where R = | p'y (, R )a (_ is the principal minor of R obtained by deleting row 
and column a, etc. R is the true correlation determinant whose rank it is the 
object of this paper to test. If R is assumed to be of rank n. — t, then all 
principal minors containing more than n — i rows vanish and B reduces to 

n 

B “ ^ j Xu, Xflrj ’ * Xdi j > ■ a 1 ( 4" " ’ ' 4“ X 1 X 2 ’ ’ * Xn * 

The tests (3) and (4) were designed to test hypothetical values of 0 by means 
of the sample Z. Evidently the value of 0 can be postulated by assigning 
hypothetical values to the X’s, the <ris, and the principal minors of R, 
Assigning values to the X's does not curtail the degrees of freedom in these 


1 S. S, Wilks, loc. cit., p. 477. 

7 Ragnar Frisch, Statistical Confluence Analysis by Means of Complete Regression 
Systems, Oslo, 1934. 
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testa because they were derived on the basis of (1) "which depends only on the 
m’s, o-'s, and p's. The X f s do restrict the range of the p’s, but not their degrees 
of freedom. 

An inspection of the expression for 4> shows that 0 can be made to assume 
any desired value irregardless of the rank of R by merely assigning the o's 
properly. It is therefore necessary to make some assumption regarding the 
tr\s if the test is to serve the purpose foT which it is intended. Here it will be 
sufficient to assume that the product of the population variances may be re¬ 
placed by the product of the sample variances. This assumption will ordinarily 
be approximately, fulfilled for the size samples for which it is legitimate to 
employ (3) or (4); consequently this assumption does not restrict the range of 
application of the test. 

To postulate values of the principal minors of ft beyond postulating the rank 
of ft would introduce hypotheses and restrictions which are irrelevant to the 
fundamental purpose of the test. This difficulty will be avoided by replacing 
all non-vanishing minors of ft by their upper bounds of unity. Since this 
will overestimate the value of B, and hence of <p, tlie usual significance level of 
.05 may be considered as decisive. Let the value of B when unity is inserted 
for all non-vanishing principal minors be denoted by V, Tlien 

n 

(10) D = S kajXflj * ' 1 Xor ( 1 1 * X 1 X 2 ' ' * 

Sinc^ 

n < n ’ ft 

XI (1 -b \<) ~ 1 “h ^ X a 4“ ^ X a , \ oa 4- * * ‘ *V kiXa ’ *' X n 

1 tt-l «1<BJ 

It will often, be convenient to write D in the form 

(11) D = IX(1 -b \t) — il ~b 2k a + ■ • \ + S X 01 X ni • • * Xa,-, 

1 l *“i <■■■<« (—1 

As a consequence of all the above assumptions, 



( 12 ) 


Z — f a 'i I _ (l ~b Xi) • •' (1 ~b X n ) I Ti± 
0 ~ <t> B 

>> (1 ~b Xl) * * * (1 + Xn) I Tij | 

D 


where | r ( j | is the sample correlation determinant. 

All the essential material for testing the rank of the true correlation matrix 
is contained in (3), (4), (11), and (12), In summary, the hypothesis to be tested 
and the procedure to follow in performing the test are as follows. 

The population of n variables from which the sample is supposed drawn is 
assumed to be such that (a) the true parts of the variables are normally dis¬ 
tributed, (b) the error parts are normally distributed but are uncorrelated 
with the true parts and with each other, ( 0 ) the product of the variances may 
be replaced by the product of the sample variances, (d) the values of the X’s 


\ 
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are postulated as judged by the accuracy in measurement of the variables, and 
(e) the rank of the true correlation matrix is n — l. 

Given,the value | r<,-1 of the sample correlation determinant, a lower bound 
for the value of Z($ is calculated from (11) and (12), This lower bound is 
.inserted in either (3) or (4), depending on the size of the sample. If (3) is 
used and if P ^ .05, or if (4) is used and w ^ 2, one may conclude, as judged 
by the sample variance, that it is very unlikely that the sample was drawn in 
random sampling from the population specified abo've. If one has reason to 
believe that the variables are sensibly normal as indicated above and that the 
postulated values of the X’s are quite accurate, then the test shows quite defi¬ 
nitely that the postulated rank of the true correlation matrix js unsubstantiated 
by the sample, and therefore a higher rank should be tested until a non-signifi¬ 
cant value is obtained. Because a lower bound rather than the value of 
is used, the test can be used on minimum ranks only, and hence a value of 
Z < <j> will not yield a test of significance. However, the test does handle the 
problem for which it was designed and which is of fundamental interest, and 
that is to see whether or not one is justified in assuming that a sample repre¬ 
sents only a certain minimum number of components. 


4. Applications 

(a) Hotelling 6 has used an example taken from other sources to illustrate 
his test on components. In order to compare results, this same example will 
be treated here under the assumptions outlined above. In this example the 
reliability coefficients are given. From the definition of a reliability coefficient 

Ti , it follows at once that r* = -—J-r-. The population values of the X's will 

1 “J" A| 

be set equal to the,values obtained from these sample reliability coefficients. 
The data for this problem are 


r f/ | = .236, N = 140, n = 4, Xi ~ .087, X 2 = .119, h 


.101, *4 = .773. 


Assume that the true correlation matrix in the population is of rank two, that 
is, that two components are sufficient to describe the results on these testa. 
Since N is large compared with w 2 , it will be sufficient to use (4). The values 
of (11), (12), and (4) are found to be 




n (i "f* X() i Tij i _, 
D 


90 


~ [1.90 - 1] = 3.76 

O 


8 Log, cit., p. 16, 
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Since the standard deviation, of w is unity, this value demonstrates clearly 
that the hypothesis of only two components is untenable as judged by the 
sample correlation determinant. If one assumes throe components, the test 
will be found to yield a non-significant value. Hence it may be concluded that 
under the hypotheses on which the test is based, the sample does not justify, 
the assumption of less than three components. Hotelling's test indicated the 
necessity for two components but was uncertain about the third, the decision 
resting upon a variate value of 1.31 as against a standard deviation of unity. 

(b) Thurstone, in his "Vectors of Mind," considers an example taken from a 
series of fifteen psychological tests- After applying his centroid method to the 
data, he inspects his results and concludes that four components are sufficient 
to account for everything except random errors. It is impossible to test his 
conclusions explicitly as above because the size of the sample is not given and 
the reliability coefficients are not known. Nevertheless, if it is legitimate to 
assume that the sample is sufficiently large to justify the use of this test, in¬ 
teresting conclusions can be obtained on the assumption that only four com¬ 
ponents are needed. 

Suppose that X* = which implies that the variance of error is half as large 
as the true sampling variance for each variable. Here (10) is more convenient 
than (11) for computing the value of D. The values of (10) and (12) are 
found to be 

D = *m) u + i S C 2 (*) J3 + uC&) u + qf - .126 

7 > kd 

<p 5=3 .0003* 

Evidently, the value of | r,,- [ must lie in the neighborhood of .0003 if the test 
is not to yield a significant result which contradicts the hypothesis. However, 
the correlations in | r i3 ( are given to only three decimal places, and therefore 
a legitimate value in the neighborhood of .0003 cau not be realised. It Is to be 
noted that the postulated values of the X's are equivalent to postulating that 
all reliability coefficients are equal to f, a value which should be considered as 
unusually low, It would seem reasonable to avoid using material in which the 
variance of error is larger than one-half the variance of random sampling, unless 
the variance of random sampling is exceedingly small. 



CONTRIBUTIONS TO THE THEORY OF COMPARATIVE STATISTICAL 
ANALYSIS. I. FUNDAMENTAL THEOREMS OF 
COMPARATIVE ANALYSIS 1 

By William G. Madow 

This is the first of several papers in which there will be presented a general 
approach to the statistical examination of hypotheses which arc false if any of 
several things are true. Phenomena requiring such a statistical theory are 
investigated quite frequently. As examples may be cited the studies of lag 
correlation in time series, periodogram analysis in geophysics, factor analysis 
in psychology, and analysis into components in agriculture. 2 

The theorems of this paper have one purpose: to permit the reduction of the 
distributions by which the hypotheses are to be tested to essentially the joint 
distribution of the statistics which contain the information offered by the data 
concerning the truth or falsity of the things which will negate the hypotheses. 
In order to do this it has been necessary to generalise the theorem of Poincare 
on the probability that at least one of several events occur, a As illustrations 
there are stated, after Theorems III, VI, and IX, generalizations of a distribu¬ 
tion derived by Jordan, (5) page 109. 4 

In a second paper, we shall give a complete derivation of the joint distribu¬ 
tions necessary for the applications of the analysis of variance. A reconsidera¬ 
tion of the Schuster periodogram will be included. In other papers these 
results will be extended to problems arising in the theory of regression, and to 
problems of the distributions of medians, etc. 

The fundamental theorems of comparative analysis arc now obtained in such 
a form that they are applicable to problems in the theory of probability no 
matter what the distributions may be. Some special cases of these theorems 6 


1 Presented to the American Mathematical Society, March 27, 1037. Research under a 
grant-in-aid from the Carnegie Corporation of New York. 

1 Naturally theso techniques are also useful in other branches of science then those in 
which they were first applied, It should be noted that by analysis into components we 
here refer to the work of Fisher, (2), chapter 0. 

1 See, Poincard, (7), page 60, This theorem ib attributed to Poincarfi by Jordan, (5), 
andFrGchet, (3). 

* This distribution states the probability that in r trials of an experiment which has 
exactly n possible results, these results being mutually exclusive, each of the possible 
resultB occurs at least once. JortIan J s derivation has been simplified by Frfichet, (3), 
page 12. 

* The theorems are, of course, part of the theory of measure and integration. 
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have been used in connection with the derivation of distributions of positional 
statistics such ns the in order of N elements, 8 and others. 

Let Q be a collection of elements and let A be a set of subsets of ft. Then, 
the axioms which the elements of A are to satisfy are 7 

I. A is a field; 8 

II. ft i A; 

III. To every A e A there is ordered a non-negative real number P(A); 

IV. P(0) = 1; 

V. If A e A and B t A, and AB = 0, then P(A + B) = P(A) + P(B). 

We shall regard ft as tire set of possible results of an experiment e. By events 
we shall mean elements of A. The complement A of A with respect to ft will 
be an element of A if A is an element of A. A consists of all elements of ft 
which are not elements of A and hence is the event which occurs if and only 
if A does not occur. 8 

Let the subsets of ft 

( 1 ) Ei , E2, ■ ■ - , Eft 


be elements of A. Then, if oti, on , ■ * • , a* is a permutation of 1, 2, • ■ • , h, 
the set 

(2) E ai E ai ■ ■ ■ E af Eaj+i • ■ ■ E af . 


is an element of A and is the event which occurs whenever all the events 
E ai , E^ , " • , E ai occur, while none of the events E a)+l , E* i+i , . ■ ■ , E« k 
occur. 

The events (1) are said to be independent if and only if 


(3) 



for all selections of the sets (1) and their complements. 18 

Theorem 7. The probability that the first j of the h events (1) occur, while the 
remaining h — j events do not occur, is 


9 See, for example, Gumbel, (4). It is noted that Theorems I, II, and III are stated by 
Amo Fisher, (1), page 42, who assumes, however, that the events are independeiit. 

7 These axioms are Btated by Kolmogoroff, (0), page 2. 

9 A set of sets is a field if the fact that A and B arc elements of the set implies that 
A B, AB, and A — AB are also elements of the set. 

9 The event A will he said to have occurred if the result the performance of the experi¬ 
ment E is an element of A. 

10 See Kolmogoroff, (6), page f) for a discussion of various equivalent definitions of 
independence. 
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(4) ' ’' EiEi +i * ■ * El) = Zl (-l)* 1 2 P{E i ■ *' EjE ai • * ■ KX U 

* , "0 a i, 1 • ■, ct 

ll<“K >m <* 7 

Proof, Let fc = j + 1. Then it follows from Axiom V that 


(5) ■ ■ * Ei) '— P(i?i Ex ■ ■ * • • • EjEj. |_i). 

Hence the theorem is true for fc = j + 1 and any j > 0. Let the theorem be 
true for ft =* j, j + 1, ■ • ■ , fc — 1. From Axiom V it follows that 

(6) P(j?! ■. • Mm ■ ■ * A) 

= P(i?i • ■ ■ EjEj+i * * ’ — P(^i ■ • • EjEj±\ * * • Ek—iEk). 

Substituting from (4) the theorem is proved. 

Let n > 7i! + ■ ■ ■ + Tit, rii > 0 (i = 1, • • ■ , t ); and let 


n\ 

nil «st • ■ ■ ttjl (n — ni — * ■ ■ — n t ) I 

Corollary, If, for each value of r, (v = 
terms 


1, 2, ■ - - , fc - j), the (fc 



P{E\ • ■ • 2?,Z? a , ■ ■ * Eaf) 

which can be obtained by selecting a\, aj, ■ - ■ , a, without repetition from 
j + lj j + 2, * ■ • , fc, are all equal, then 

(7) P{E, ■ • • E,E W •••&) = 2 (-l)"(fc - j; v)P(Si • • • £,+»). 

v="Q 

Let 

(8) SW= E P®.,*!., •••£.,) 

**l»* * 

OL<*“<«» 


where the summation extends over the (fc; j>) terms 


(9) ■ • • Eaf) 

which can bo obtained by selecting v of the fc events (1) without repetition. 
If all the terms (9) which can be obtained by selecting v of the fc events (1) 
without repetition are equal, then 

(10) S(v) = (fc; r)P(Ei • ■ ■ EX 


11 By definition 

£ t-i)' E • ■ • *i*m ■ ■ • *■ J 

V-0 Of l," ’ ‘iff r™/"W 

«lC 1 '<i>> 


== P(Ei 


»()+£<-»' E P<A 

V~ 1 Oi. 1 ■ , .a r “j+ 1 

f*l <" 1 < CT r 


EjEcti ‘ 1 ‘ Ea r )> 
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Theorem II. The probability ihai exactly j of the k events (1) occur is 

(ID P« = £(-i)’(j + *;f)50+»). 

v -0 

Proof . If Av) ,is the subset of ft defined by the requirement that exactly j 
of the events (1) occur, then A is the sum of ( k; j) disjunct sets: 

t 

(12) Au) = 2 E ai ■ ■ ■ E al E ai+l • ■ ■ E ah , 

where a/ +1 , • • ■ , have those of the values 1, • ■ • , k which remain after the 
selection of cu, ■ • ■ , a,-. By Axiom V we may replace A by P in (12), Upon 
substituting from (4) we note that the resulting terms of (12) which depend on 
the same number v, v = j, • ■ ■ , Jfc, of events have the same sign, that all S(v), 
v = j t ... r k, occur, that no term depending on fewer than j events occurs, 
and that any particular P(E ai E ai • ■ - E aj+I ) will occur in those of the terms 
of (12) the j occurring events of which are a subset of E ai , E a2 , • • • , E aj+( 
and will occur in no other term of (12). Hence the coefficient of S(j + t) in 
(11) is (—I )* (j + t) J). This completes the proof of the theorem. 

Corollary, If (10) is true for v = j t ... , fc, then 

(13) P(,-) = £ (- l/fa; j, r) P{EiEi • - ■ Ef+X 

K -0 

Theorem III. The probability that at least j of the k events (1) occur is 

(14) P w = 2 (-l)'(i + » - 1; i >)8U + >). 

v =0 

Proof, If A {]) is the subset of ft defined by the requirement that at least j 
of the events (1) occur, then A 0) is the sum of k — j + 1 disjunct sets: 

(15) + A(,- + i) + • - ■ + . 

By Axiom V we may replace A by P in (15), Substituting from (11) . 

(16) P (i) = 2 e. S(j + ,), 

»=0 

where 

c * = (j + v\ j 4- r) - (j + v\ I) H-+ (— l) v (j + v) u), (v = 0, * • -, k - j). 

It is easy to prove that 

(17) (-I)'(J +»-1; *) = i (-1 r'(j + p-.j + !>). 

J <"0 

Corollary. If (10) is true for v - j, ■. • , k, then 

(18) P w = 2 (-l)'O' + p - 1 + 1.)P(E,& • ■ • E l+ ,). 
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To provide examples illustrating these theorems let us consider r experiments 

(19) E m , ... , E (r) 

Let E (<) have k mutually exclusive outcomes 

(20) oj’\ OS", • • • , ol®. 

Then, it is easy to define the spaces A ll> the probability function Pi{E ( ' y ), 
the combinatory product 

12 = fi (n x a (z) X •.. X 

the set A and the probability function P(E ) bo that Axioms I, • >* , V are satis¬ 
fied and hence Theorems I, II, and III are valid. 

We shall assume that the experiments (19) are independent. 

Let 

Oj (j — 1 } • ,k) 

be the event which occurs when neither Of ] nor Of } nor ■ • • nor 0,- r) occur* 
Then Of occurs if upon performance of the experiments (19) at least one of 
0} J) , ■ i 0) t) occur. 

It is an immediate result of the definition of independence that 

(2D p(< x, a,, ■ ■ ■ o.,) = n 11 - p(.oi V)- p(o“]) i. 




From Theorem I, the probability that Oi, 0 2 , • • • ,0,- each occur while not 
one of 0 J+ i, 0 J+ a, ■ , 0* occurs is 


m 


( 22 ) 


i ■ ■ • 0, 0, + 1 • ■ • 0*) “ ^ (—1) F 

p-0 m i.** '.a 


«l. 


n (1 - f’<o5Vi)-P(oi ;> ) - Pl.o"!) - P(0S,‘’)). 


From Theorem II, the probability that exactly j of O x t 0 2 , ... , 0* occur is 
(23) P< n = £<.- M* -j + v, ,)S(k - j + *), 


*-0 


where 

m-j + f) = 


E n {1 - P( 0 i?)-P(oS,‘, ’-(*,))• . 

a 1 < aa < ■ * '<a*-|f+r 

Since the probability that at least j of Oi, Oj (h occur is equal to 1 
minus the probability that at least h — j + 1 of Oi, 0*, ■ • ■ , 0* occur, 12 it 
follows at once from Theorem III that 


P{at least j of Oi, • * <, Ok occur) — 


(24) 


1 - £ (-l)’(fc - j + r,y)S(k - j + , + .1). 
►"0 


11 There are, of course, other ways of computing these probabilities, 
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The case treated by Fr4ch6t and Jordan is that which occurs when we assume 
P^) = P(0!°), (f = 1, • ■., k), {i, h - 1, • • ■ , r) and in (24) let j 1. 

It is not difficult to obtain further generalizations of Jordan's distribution, by 
defining events which occur if and only if fewer than f of r events occur and 
then proceeding as above. 

Certain useful generalizations of Theorems I, II, and III will now be derived. 
Let the subsets of ft 

(25) Eg*\ •«■, (s = 1, • • • , p) 

be elements of A, and let N = k w 4- & <2) 4* ' * ■ + k lp) . 

Let; (,) < fc (,) , (s = 1, ■■•,?); and let 

(2a) Q w = nw d=i, ■■■,?), 

■-1 i-l 




(t — l, • • ■, p), 


Furthermore, let for each value of s, (s — k, • ■ ■ , p), the (k^ — j (t) ; j/ ,J ) 
possible distinct selections of v (t) of the k l,) — j U) Bets 

(28) Pj(*5+i, • ■ ■, Pi*l) 


be arranged in some order, and, if the intersection of the v J) sets of the t s ‘ h 
selection be denoted by 


(29) 
let 

(30) 




(« — A, ■ • • , p), 
(i. = 1, 2, • ■ • , (k M - /*>; ■.“>)), 


I'*'" V, • •..» w ) = fl ?%“). 

i-Jl 


There are fl (fc f,) - j <f) ; y (,> ) sets (30), for each value of h, (h = 1, • ■. , p) 

t-h 

and any set of fixed values of • , j/ p \ 

Let for each value of s, (s =* h f ■. • , p) the y (0 ) possible distinct selec¬ 
tions of v (t) of the k (,) Bets 


(31) E\'\ (i = 1, *.. , fc w ), 

be arranged in some order, and if the intersection of the seta of the i, ih selection 
be denoted by 


(32) 4> w ) 
let 

(33) , ,>>) = f[ 
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There are (A;^; r w ) sets (33), for each value of h } (h = 1, ■ ■ ■ , p), and any 
set of fixed values of v 1 * 5 , • • ■ , 

It is clear that the various sets that have been defined are elements of A. 
The fact that the sets are the events which occur if and only if certain sets of 
events occur is also too obvious to require further comment. 

Theorem 17. The probability that of the N events (25) the first j {,] of super¬ 
script s occur and the remaining k (>) of superscript s do riot occur, s ~ 1, > • ■ , p, is 


(34) 


P(Q (p V p) ') = E E 

]|(1 ) bsQ 


• •• E (-1) 

y t P) pr 0 

(JfeU)-,•(!);,Up 




T 

(i-i 




< P -i 


Proof. Theorem I is a proof of Theorem IV for p = L The theorem may 
then be proved either by regarding it as a special case of Theorem I and col¬ 
lecting terms, or by induction. 

Corollary. If, for each possible set of values of v w , • • • , v (p) the 


fr ^ - /v 0 ) 


terms 


(36) 

are all equal, then 

W l 


P[g‘ l , n w )] 


p(QlplgW') = 


(36) 




(- 1 ) 


k (l 


y (j>M) 




a=L 


Let, (or each value of A, (h = 1, . • •, p), 

Sfy m , v M , • • •, /*) 

(37) 


(A-(A>;kW) 

- E ■ E p[q m q ( ' , ~ 1) '^ v ’ <i ' p (v ( a> 
£^1 


• /"’)]. 


ip^l 


It is apparent that by using (34) it is possible to obtain an expression for (37) 
which* does not depend explicitly on In fact 


. fc(h-n_v(A-D 

- V ^ 


S(v ih \ ■ • *, v lp) ) - E E < 

^oTy«0 ^ 


(-D 


p (1 )-|- . ■ »-J-y (A — L) 


(fc(D—,(l); k (l)) (jfe(A); k «)) (A(p);,.CPp 

£ ... £ £ ... £ 

<i-i U-i-i 4-1 <p-1 


P[ ? < ‘ -- , ‘-(v (1, | ..., , v'' 1 )]. 


(38) 


• • t 
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If the different terms of (37) are all equal, then 
(39) S(v a \ = ft 

t-h 


If the different terms of (38) are all equal, then 


s(p m , , w ) =. ' 2 ...' 2 (-i)' m+ - + ' <k "’ 

f O )■{} jrCfc—1 )■(] 

(40) ft (*'■ WV°) ft (*='V*>) 

P[q h " V°, • * •, v (ft ~ 1) ) V h \ • * • i V (p) )]. 

Theorem, V. The 'probability that of ike N events (25) the first j (,) of superscript 
a occur and the remaining fc (i) do not occur , (s *■ 1, ■ * *, h — 1), orwi exactly j w 
events of superscript a occur (a = /i, • • ■, p), is 




(«) 


o'*-”’) = 2 • ••2 (-D 

r(A)-8 rW-0 


ft (/•’ + »V“>«“> + • • •, f* + r«). 

I-A 


Proof. The theorem may be proved, either by induction using Theorem II, 
or by obtaining disjunct sets as in Theorem II and using Theorem IV. 

Corollary I. If (36) is true for all sets of possible values of v w , ■.. , v c,l) 
then 


(42) 




*(*)-,{*) 



fc(p>-y(p) 




(-i y 


(*)+... +F (p) 


ft Vc w ;/‘ > , v w )PIQ <IM, e < ‘- ,, Y'''V‘ , ) ’■ 

«-A 



Corollary II. If (40) is true for all sets of possible values of v m f • * • , p <r) 
then 


fcOUv(i) *0>>-y<p) 


P( l M.:m(Q fh - ,) Q' i '- ,) y= 2 2 (-i)- (,,+ - 

y(jO—o 




(43) 


ft (fc w -; w ;n w 

1-1 


) ft /") 




P[q' -V'\ 4 1 ->(,<m 

Theorem VI, The probability that of the N events (25) the first events of 
superscript s occur and the remaining fc (,) do not occur, s =s 1, . ■. } g — 1, exactly 
j events of superscript s occur (s = g t ,h - l), and at least j U) events of 
superscript s occur (a = A, ♦ < ■ , p) is 
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= t ... t (- 1 ) 

<(pM 


(44) 


/l-l 

n 


p 


>(p)+.. 1+p{p> 


n (/■’ + - f ") n U M +<-“ -1;/") 




S(i , ’ ) + v {, \ y ■, f r) + v'*). 

Proof. The theorem may be proved either by induction using Theorem III 
or by obtaining disjunct sets as in Theorem III and using Theorem V. 
Corollary I. If (39) is true for all sets of possible values of 

» 


So) „<fl+i> 

f V ) 


V m \v -, ■ • • J V' 


then 


= E ■■■ Z (-1 


( u )-0 


r<P>-0 


(45) 


ft v“) ft [(/■> + - 1; »“)(*“ ij" + r“)] 

J"fl 

p[Q M Q Wf.'.l {v U ,) f ... f 

Corollary II. If (40) is true for all seta of possible values of v {0 , t» w , • •« , v (p) 
then 

fc(l)_j(l) Jfc(p4-/<p) 

^ V ... V 

p(l)ia0 

(46) ft (fc W - j“> i r W ) ft (fc " 1 ;J w , *“’) ft Ki " 1 + r w - + » w )] 

I-A 

P[g 1 ' "V”, • • ■, r 1 "" 1 ’) g 1 "'V’, • • •, v w )]' 


£ ••• £ (-D 

»(D=o k < p 7 —□ 


v (1)+. *«>t*plp) 


• -L 


s—o 


Let us again consider the experiments (19), and let us assume that 
E £l) , (i = 1, ■ • • , r) has as its mutually exclusive results 

(47) Oil* (i = 1, • •• ,*“’);(« = 1,2). 

Let Of. be the event which occurs if, upon performance of the experiments 
(19) at least one of the events O^, 0 £ ?, ■ ■ ■ , 0*I ) occur, and let 6 ( , be the 
event which occurs if and only jf 0i» does not occur. 

We may state the probability that the event Ei , which occurs if and only if 
at least j (1) of the events O a , (t » 1, • •, h (1) ) occur, and the event E t , which 
occurs if and only if at least of the events 0« t (t = 1, ■ - • , k (3) ) occur, both 
occur. 

It is apparent that 

(48) P®ft) = 1 - P(A) - P(A) + PiEiEt), 

where Si is the event which occurs if and only if E , does not occur, (s = 1, 2), 
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From Theorem III 

p{E.) = E (-1 y l, ’(k M -1 + " w ;>'W’ - i +1''’' +1) 

(49) 

(*= 1 , 2 ), 

where 

s“ft" -}“’ +■ + u = E 

o l <' r ‘ <«Jfc( •)—j (i )+j>( • )+l 

nil- P(O l aii) — ■ • ' — P(0a^(,) .( ( > +(p (i) +l <)}j (® = lj 2). 

i-l 

From Theorem VI 

^ p(m) = 'e 1 n r -/■> +/■’; r w ) 

(50) ,( 1 M K (a )-0 «»i 

£(fc (1) - i (1> + * (l> + l, & (a) - j {2) + * <i!) + i), 

where 

s( fc <»_ j «> 4 ./» + i,fc <! ’-j® + /° + i)= E E 

M"1 1 

Ptf‘"(A 11 ’ - /“ + » 0> + 1, fc™ - f + <P + 1)], 
and 


?(«“ V - j“ + - 11 ’ + 1, k m - j w + «“> + 1)1 = 

II i- E p(o»?o - E noS), 

i-l ( r-l n-1 J 

the subscripts a r t (v = 1, . ■. , k (1) - + r (l) + 1), being those of the i 2 lh 

selection of k (i ] — j 0) + v (I) + 1 events from k (i) events, and the subscripts 
, (n = 1, ■ ■ ■ , k {t) — j® + + l)i being those of the fa lt selection of 

k w — j iV + v (i) 4-1 events from k {i) events. 

The desired probability is then obtained by substituting from (49) and (60) 
into (48). The procedure is perfectly general, and applies directly to situations 
in which p > 2> 

We shall now investigate the results obtained by requiring that the events 
considered satisfy a relation of implication. 

Let the subsets of SI 


(51) 

Eu , Eh , ’ 1 • , Ekt , 

(s — 1, • * < 

>P)> 

be elements of A, and let 




(52) 

Eu C Eh » 

(i ~ 1, • • ■ 

,*) 

if a < (. 
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It follows that 


(53) P(ft.ft;,) = P(Eu), (t = 1, ... ,fc), (s < (). 
Let j, < ji < • • • < jt and let 

(54) 


! Ji 


Let ji < jj < ... < j { and let 


Q, = II lift., 

1 


(55) 


e! = n n ft., 

*-l Wi+l 


(i — 1,2, • ■ • , p). 


(1 = 1,2, >--,p). 


(56) 


From (52) and (53), it follows that 

mgi) = .p(Tn it «.l 

\L*“i J 


T44 jVhi 1 * \ 

n n I. n ft,), 

_a-J i—j,+l Ji™/ i+l / 


(jo - 0) (1 — 1, 2, ■ ■ ■, p). 


Let ji < ja < ■ ■ ■ < jp and for each value of s, (s = 1, • - • , p), consider a 
selection of j, J- v t events of second subscript 5 from (51). Let the p selections 
thus obtained be such that 

3» + v, < in- 1 » (s - 1, 2, . ■ •, p), (jp+i = ft), 


and if E is one of the events of the selection of events of second subscript s 
then the fact that t > s implies that Eu is one of the events of the selection of 
events of second subscript t. 

From (52) and (53), the probability of the occurrence of all the events of the 
p selections thus obtained is a function of j p + v P events, h> of which are of 
second subscript s, (s = 1, ■ - ■ , p) where 

(57) m + /i 2 + ■ ■« + ii t = j, + v, f (s = 1, -«• , p), 

and for a given set of values of ji, , * * ■ , j p the p, and r, determine one another 

uniquely, (s = 1, ■ ■ ■ , p). 

For a definite set of values of ji, ■ * ■ , j p and m , • ■ • , gj* or ji, ■ * • , j p and 
v\, * • • , v p there will be 

(j*+i - Vt) = (Mi - j»; ja+i - Mi - • ■ • - M«), (s = 1, ■ • • j V )i (iiH-i = &) 

possible distinct selections of j» + v,, (s — 1, • • ■ , p) events of second sub¬ 
script a, j, of which are preassigned, from j t +i events, (s = 1, ■ • • , p). 

Let these selections be arranged in some order for each value of s, s — 1, • •., p, 
and let 

(58) 5i,i 2 ... ij,(mi > Hz > '' * ) Hp ) 

be the event which occurs when for all values of s, (s — 1, •• ■ , p), the events 
of the i a th selection of j, + v, events of second subscript a all occur. 18 

11 It ia understood that the j, predesigned events of second subscript s are among the j| 
predesigned events of second subscript t, (i > a) in the events (68), 
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A typical event (58) is 
(59) Qi.. i(ml> 



fa+l", 

n Bu, 


(jo + — 0), 


There will be, for a definite j a events of second subscript s, (s = 1, ■. ■ , p) 

(60) {l (ji+i ~ j#; v t ), ' (jjj-s-i »fc), 


events such as (58). 

For a definite set of values of mi , • • ’ , Up there will be, for each value of s, 

(s - 1, • ■ • , v) 

(k - fh-i - ■ ■ • - Mi J M *)> (s = 1, 2, * * ■, p) 

possible distinct selections of j, -f v, events of second subscript a, + y.-i 
of which are preassigned from k events, (s = 1, ■ ■ • , p). 

Let these selections be arranged in some order for each value of s, 

( s - 1 j • • ■ i p)j 

and let 

(61) , {h'ii» **• ip(Mi i Mz j 1 1 * j Mji) 

be the event which occurs if and only if, for all values of s the events of the 
set of j, + v, events of second subscript s all occur, (s = 1, • • - 1 p), and 
the first subscripts of the events of the L th set of events of second subscript s 
are among the first subscripts of the events of all the selections of events of 
second subscript greater than s, (s — 1, * ■ ■ , p), 

There will b6 

(62) (k; mi , Mz, • • * , Mp) ' 


events (61) which may thus be obtained. 

Theorem VII. The probability that of the pK events (51) the first j, events of 
second subscript s occur and the remaining k — j s events do not occur, s = 1, • • • } p, 
is 


(63) 


is~h k-jp 

P{QM = £ £ ... E (-i)” +r,+ - 

^j-0 yI H} 




Ut-dj^ri) Us-pijVj) (Hp> c ) 

A •*' "^(Mli M2 1 ■ 1 * i Mp)]» 

i|**l ilu-0 


where the event Q f determines the j, — j,_i — v,- x eventB of second subscript 
8, (s =* 1, *. • , p), which have as first subscripts all numbers 1, 2, • ■ • ,j, which 
are not among the j,_i -f p,_i numbers determined by the events of lower second 
subscript than s which are contained in 9^ ^ (mi , ■ ■ • , Mp). 

Proof, Expand (56) by means of Theorem IV. 
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Corollary. If, for each fixed set of values of pi , p 2 , • • ■ , Mp the terms (58), 
in number (60), are all equal, then 

. , me;) = e' r ■■■% <-ir—-na +1 -i. ; ,) 

(64) • , j" a •'p" 0 «—l 

H?i-..i(pi, M2, ■ « ■, Up)] 0‘p+i = k). 

Let 

(.h-jtl-.iii) {k-iii —''"Pr-LiPf) 

. . pi t '' •, pjd *= 2 Z) 

(65) o-i <i-i 1 

Mi j • • * j Mp)l" 

If all the terms of (65) are equal, then 

(66) T(fii t 4 •«, fi v ) = (k; m, ii 2 , « • ■, p P )P[qi...\(jn l ■ ■ * i Mp)]- 

\ 

Theorem VIII. The probability that of the pK emits (51) exactly j, events of, 
second subscript s, s — 1, - • - , p occur , is 

>'l"j'l j'3-^jj h-U 1 

Pe,E E ••• S (-i)’ ,+ ' ,+ - + '' 


(67) 


Ki”0 


ll (m.I j. - Ml - * ■ ■ - Mj-i) T’CmIj M2» ' * ■ , Mp)* 


Proo/ ( If Aa l , is the subset of 0 determined by the requirement 

that exactly j. of the events (51) occur (s = 1, . ■ • , p), then , /,) is the 

sum of 

(^i j\ i Ja — Ji i Ja Js i - * • , jp — jp-i) 

disjunct sets which may be obtained by replacing P by A in (56) and forming 
(56) for all selections of j a — j,~i occurring events from k - j,-i events, 
(s = 1, • - - , p). By Axipm V, P< Jl( ..., y p ) is the sum of the probabilities of 
these disjunct sets. 

Substituting from (63), it is noted that all terms (61) which depend on the 
same p,, (s = 1, • • ■ , p), have the same sign and that all T{p i, m , • • ■ p P ) 
for which 

0 ^ V* ^ Ji+1 js , lj ■ 1 ' ) P); 

appear and only those appear. Furthermore any particular term (61) will 
occur in those of the terms (63) the j , — j t -i occurring events of second sub- 
script s, (s = 1, ■«■ , p), of which contain a fixed v t -\ events, the remaining 
j t — j t _i — v,-i events being a subset of the p t events of second subscript s, 
(s = 1, ... , p), that actually appear in the particular term (63). Hence the 
coefficient of T(p i, - • ■, pf) is 

(_!)., w - 


(po = 0). 
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ConoLLABY. If ( 66 ) is true for all sets of possible values of /h , jus, • • ■ , /i p 
then 

p«. iP) ='£' , S , ...‘f (-D~ 

*i “0 v p "-D 

(£>Q\ 

(fcj ji} vif i* ~ii — v ij p 1 *>ip — ip-i" > ***) 

M 2 , - • • ,Mp)J. 

Theorem IX. The probability that of the ph events (51) at least j a , but not more 
than j g +.\, events of second subscript $ occur, (s — 1 , • ■ • , g), and exactly j, events 
of second subscript s occur, (s — ff +■ 1 , ■ • ■ , p) is 

(69) = E X! 2 ‘ V i®g)i 

6 g“Q ^“0 

where, if a 1 in the i th position is denoted by 5,-, (i = 2, ■ * ■ , 17 ), 

■Sc/p+i.--•.;?)(!, Si, ,S yit 0 t ■ * ’, 0, fi T2 +L, • • •, fiti, 0, ■ ■ ■, 0, ■ • •, fi yj ,+i, ■ • • ,S B ) 

-E-- E E ••• E E •••E (-ir i ^ ,+ - + '» 

fp-° I'p-M-O t'f-O ► > l”° 

fr°) 0‘i + pi - i; pi) • • • (i v , + - jyi-i - - 1 ; p Y ,) 

(Ju + Vy t - hi ~ Vyt - 1; Py ( ) ‘ • ■ Op + v p - - v p -i) v P ) 

T{ji + n, ■ • • ,Jti + Py, ” hs-i ~ Py*-i, 9, ■ • •, 0, 
hi + Py* — jyt " Py 4 , ’ ’ 1 , jp + Pp i?-l “ »V-i)- 
Proof, We note first that there are 2 f ^ tenna in (69). Since 

(71) PftA-t ■■■ E £ P*... 




Jp+i-"ip)5 


the theorem may be proved by a process of repeated summation. From (67) 
and (71) 

*2 Xl~*l k j-M Wo 

pH;) .*, 1,+j-w = E E- E ••• E (~-i)" + ” + - + > 

M"/i n** 0 >j-o K_p-o 

(72) 

(Xi + J>ij M)(Xa + ^2 ~ Xi -- Vi] p 2 ) • ■ ■ (jp -f v v — — i/p^; p p ) 

^(Xi + 1 * 1 , X a + Vi - Xi — u if '' ‘ i jp v s> Jp-i — Vp_0« 
For fated values of Xj, X 3 , ■ ■ • , \ there will occur jn (72) all terms 

(73) T{ji + ft, X 4 + v% - ji - ft , * ■ * , j v + v p - j p _! - vp-i), 

( 0 i = 0 , • • • f Xa — jj), (0 < p. < X 4+ i — X,), (s = 2 , ■., , p), 

(Xp^* “ 5 = 1 > • ■ • , p — fil), 

and any definite term (73) will occur in all 

“ii ip} 



THEORY OF COMPARATIVE STATISTICAL ANALYSIS 


173 


for which 

0 < a < ft. 

In (74), the definite term (73) will have coefficient 

(-1)*-*-*+ •” + '*(ft + ft ; ft + a)(X. + * ~ ft - ft J *) 

(75) • • • (ip + v p - ft,i - f» P -i, J'p), (a = 0, 1, ■ ■ ■ , ft), 

(ft = o, ■ *', Xs — it). 

Hence, in (72) the definite term (73), will have coefficient 
(_i) 0 ,+„+... *,y, + ft _ i; a)(x 2 + V, - - ft;«) 

• • ■ (Ji + “r ~ Jr-i - ■'t-i i >p)i 
and 

(76) •■ ■>„)(!)- 
We now evaluate 


x» 


(77) 


p(i’i /i) _ p(A} 

t (X ,~ Is 


X 2 -J 1 


(78) 


For any fixed values of X 3 , ■ • ■ , X,,, there will occur in (77) all terms 

T(ji -j- ft , ji + ft — ft — ft > Xa + — ji — ft ] 

■ * ■ 1 ft + Vp — ft-i “ ^P-Oi 

for which either 0 < ft < Xs — ft ) 0 < ft < J 2 — ft — 1 or ft = ft — ft + 7, 

0 < 7 < X3 — J2 ; 0 < ft < X3 ~ ja — 7 - 

Let 0 < ft < ji - ji - I; 0 < ft < Xs - ft. Then the term (78) will occur 

in all 
(79) 

such that 

0 < a < ft. 

In (79), (78) will have coefficient 

(_ 1 )H,+ft-.+.i+...+'»Q 1 + _ 1 ; (3,)y, + ft - ,1 - ft - 1; ft - a) 

(ta + ft — ia "■ ft J ft) (jp + Vp jp-i ~ v p-i } v p)‘ 

Hence in (77), (78) will have coefficient 

(_ 1 )^*+M+...+P P (ji + ft _ 1; ft)(ft + ft - ft - ft - 1; ft) 

(Xa + ^3 — ft — ft J J's) • • * (ft + Vp — jp -1 ~ ^p-i > v p)> l 

(ft = 0 ,. * • ,ft— ft — 1), (ft — o p • • • , Xj - ft), 

(v, = 0, • ■ • , X*+i — Xg), (<s = 3, ■ • ■ , p)j 

(X„+. = jou), (s - 1, * ■ ■ > V - fl) 


(80) 


( 81 ) 
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Now let ft = j% — ji + v t Q < v < h — jtQ < fa < h — ji — 7 . Then the 
term (78) will occur in all terms (79) such that 

7 < or < ft, 


and in ( 79 ), (78) will have coefficient (80). Summing for a, (a = 7 , . *. , ft), 
we obtain as the coefficient of (78) in (77) 


and 


Hence 


0 , 


if ft > 7, 


(— l) ( ‘ +, . + - + '"0'i + A - 1; A)(Xi +- ji - A • k,) 

■ + # ( Jp H~ Vp — Pp—i t ^j>)j if ft = t« 


(82) 


1 ) + 0 ). 


If we examine (82), we note that the result of summing with respect to X* 
has been the replacement of (76) by two sums which are similar to (76) in that 
the next summation index, in this case X 3 , occurs in exactly two limits of sum¬ 
mation. If it can be shown that, the two sums which occur in (82) each result 
in a pair of sums after summation with respect to \ 3 , or more exactly if 


>i+i 


ftj • • ■ f ft) 

(83) *.+i-fr+i 


— ft) • • • j ft: 1 ) + ft) 1 ’ ■ ■ j ft 10 ) 


then the proof will be completedr 

Since the truth of (83) may be demonstrated in exactly the same way in 
which (82) has been shown to be true, the theorem is proved. 

Corollary. If ( 66 ) is true for all sets of possible values of ai, £ 2 , ■.. , p p 
then 


1.—.f P )(l) ft» * ftijO, ■ ■’, 0 , S yi+ i t • ■ - A,, 0 , • ■ * 0 , • ■ * * ■ • ,ft) 

k—'ip 1 1 jp+i“"?(r 1 

.-E- E E ••• E E (-1 

, p md 0 *'<)+• 1 ® t yrhr i ’n 


r ,-0 


0*1 + Vi~ ljj'i) ‘ ■ 0 ‘yi + hi - hs-l - Vy,-1 - 1; Pyi) 

(84) OVi Hh v 1{ - - v yi - 1; v u ) • * • (j, + h ~ jp-: 1 “ » P -i J v p ) 

(k \ji + vi, ■ ■ ■ t j yi + p yi - j T| _ t - 

+ f'n - hi ” v yi , • ■ • ,jp + Vp - jp -1 — Vp-i) 
« • 

-Pttfi-.-iO'i + vi, * ■ •, j yi + - j 7 ,_ 1 - ^,-i, 0, - ■ ■ , 0, 

hi + v y< - j yt - v u i • • ■ , j p + y p - j - j) p _i)]. 
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Let us again consider the experiments (19) and let E {i} have as possible results 
Oj t (j = lj ■ ■ • j ^)i ($ ~ 1, 2) (i =* 1, 2, * • • , r). 


Let 

/)(<) -v n to (i = 1, ■ ■ ■ , r), 

°" " o = i, 

i.e. Ojj* occurs whenever occurs. Furthermore let the outcomes 


off, aii 0 , • • •, oi? 

be mutually exclusive. 

Let 

, <5*, 

occur if and only if none of 

0}?, Off,... .off 


(« = 1, 2), 


occur. 

We may wish to know the probability that at least ji of On, • > ■ , On and 
at least ji, > ji, of (5u, , ■ ■ ■ , Ow occur. 

From Theorem IX this probability is equal to 

(85) - *(1, 1) + m, o), 


where 


BO, i) = ‘f‘ i, £' 1 (-l)’' + '’(i 1 + n-i;v 1 ) 

V2— o Pi—0 

(j 2 4- vi - ji - n - 1; v*)T{ji 4- Pi,ji + va - ji — vi)i 

and 

B(l, 0) = + n - 1| Ki)r(j'i + i>i). 

I’rJr/i 

From (63) 

(86) T(ji 4" vi, jz H - ^2 — *7i ^i) *= X) ]C 

O-i < 1-1 

P[& 1(1 0\ 4- Vi J ji + V2 — jl — FOl, 


where j from (61) 


fi+n _ Zi+f 1 ! 

4u^0i 4- j*2 H- — j\—vi) = n n » 

f—j'i+Pi+I 


the subscripts 

(87) , «2, • • ■ , a/,+r, 
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being the first subscripts of the ii th selection of ji + n events of second sub¬ 
script 1 from 

On, On i • ■' > Ojti, 

and the subscripts 


being the first subscripts of the ^selection of j 2 -f v 2 events of second subscript 2, 
ji + vi of which are (87), from 

Ol2 , Om , • ‘ * i 0*2 . 


It is easy to see that 

PfejO’l + VI, ji + v 2 - j\ 


r ( i’l+i'i 

».)]=n i - e p{of, ■) 

l r»i 


S p(0 


Furthermore 

(88) r(ji + mi) « £ + vdl 

{l-l 

where 

r ( i\+n ) 

p[fo(j. + »i)l = n a - I 

I 

Substituting from (86) and (88) into (85) the desired probability is obtained. 
It may be remarked that theorems which have the same relation to Theorems 
VII, VIII, and IX that Theorems IV, V, and VI have to Theorems I, II, and 
III may be obtained without much difficulty. \ 
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REPLY TO MR. WERTHEIMER’S PAPER 

Richmond T. Zocii 


The attainment of rigor both in applied as well ns pure mathematics is a slow 
process, and for this reason criticism of my paper, if constructive, is welcomed. 

Properties like continuity, differentiability, and dimensionality are local 
properties, that is to say a function may be continuous or differentiable over a 
certain range but not outside this range, or otherwise a function may be con¬ 
tinuous or differentiable over a given range except for singular points. 

The presence of singularities in functions does not necessarily cancel their 
utility. Thus the function y ^ tan x contains poiuts where it is discontinuous, 
but ordinarily it is regarded as a continuous function and the presence of these 
singular points seldom handicaps one when working with this function. Simi¬ 


larly, the function / = x - i — is a function which satisfies all four Axioms as 

^2 

stated in Whittaker and Robinson’s book and expresses the mode of Pearson's 
Type III curve as a symmetric function of the measures. The fact that this 
function is not differentiable along the lino = Xs = % = • • • - x n will never 
handicap the investigator for unless the frequency distribution is clearly skew 
the Type III curve would not be used to represent it. 

It seems that Mr. Wertheimer bases nearly all his criticisms on the tacit 
addition of the word “everywhere" to Axiom IY as stated in Whittaker and 
Robinson’s book. The word “everywhere” is not in the statement of Axiom 
IV and I assumed nothing else than stated in the axiom. 

If one deliberately adds the word “everywhere” to Axiom IV then nearly all 
my criticisms of previous writers are incorrect, unfair, and unjust. However, 
it does not seem that clearness and rigor in mathematics are increased by read¬ 
ing into an axiom a word that is not there. 

Consider first the criticism in my paper which remains valid even when the 
word “everywhere” is added. (Schimmack uses the word “everywhere” on 
page 127 although Whittaker and Robinson do not.) Both Schimmack and 
Whittaker and Robinson proceed as at the top of page 217 of the book by the 
latter authors with the statement: “In this equation make fc —* 0 then each 


of the quantities 




tends to a value which is independent of the %'s 


)} 


" a/ 

T'his statement rests on the tacit assumption that the qu anti ties -*■ are func- 

L9® n . 


tions of k. Even if such were true the use of tacit assumptions in a rigorous 
proof is objectionable, bat as a matter of fact these quantities are not functions 
of k. Thus the particular proof given in Whittaker and Robinson’s book as 
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well as in Schimmack's paper is altogether lacking in rigor even when the word 
■“everywhere” is added to Axiom IV, Both Schiaparelli's and Broggi's proofs 
.appear to be entirely rigorous if the word “everywhere” is added to Axiom IV. 

In preparing my paper I assumed that no prohibition on functions which had 
■singular points Was contained in Axiom IV. In other words, I assumed since 
the word “everywhere” did not appear there was no valid objection to intro¬ 
duce and discuss functions with singularities. The functions I introduced are 
everywhere continuous but they are not differentiable along the line in Euclidian 
n-space defined by xi = a's = Xt = • ■ ■ - x n . They are differentiable at every 
■other point in the space. 

It seems to me since Axiom IV as stated in Whittaker and Robinson's book 
does not exclude functions which are not everywhere differentiable that all my 
criticism is fair and just, and moreover nearly all my statements are correct. 
Mr. Wertheimer is entirely correct in pointing out that the words “everywhere 1 ' 
on page 181 of my paper are contradictory. As a matter of fact the whole, 
paragraph beginning with line 7 on page 181 appears to me, on reexamining it, 
to be unsatisfactory. Except for this single paragraph I believe my paper to 
be rigorous, but I welcome further criticism. 

Mr. Wertheimer’s conclusions in his paragraph number 4 are clearly errone¬ 
ous. To show this, consider a function of h, As k —> 0 any one of three situa¬ 
tions may arise, namely: (1) The function may become infinite, (2) the func¬ 
tion may become indeterminate, that is it may take on any value whatever, 
(3) the function may approach a unique finite value independent of ft. Neither 
Scbimmack nor Whittaker and Robinson nor Mr, Wertheimer has established 
. as a definite fact that the particular type of function here in question approaches 
a unique finite value independent of A: as ft—> 0. The truth of the matter is that 
this conclusion cannot be established because the function in question does not 
involve k either explicitly or implicitly. 

In conclusion there are two things I wish to emphasize, First, even when 
the word “everywhere” is added to Axiom IV, the proof given in Whittaker 
and Robinson's book is faulty, but if one consults the references given there 
in the footnotes he will find two other proofs which are rigorous with this ad¬ 
dition to Axiom IV. Second, the mode of a skew bell shaped Pearson Fre¬ 
quency Curve satisfies all four axioms as stated in Whittaker and Robinson’s 
book, and the fact that these expressions for the mode are not differentiable 
along a certain line is never a handicap to the statistician. 


George Washington University. 
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CORRELATION SURFACES OF TWO OR MORE INDICES WHEN THE 
COMPONENTS OF THE INDICES ARE NORMALLY DISTRIBUTED 


By George A, Baker 


Indices arc widely used in statistical analyses. 1 In many cases incorrect 
conclusions are drawn because indices are not uncorrelated or independent even 
though all of the component variables are independent. In a previous paper 2 
the distribution of an index both of whose components follow the normal law was 
given exactly he. without approximation. The purpose of the present paper is 
to give the simultaneous distribution of two or more indices when each of the 
components follow the normal law. The case for two indices will be discussed 
in detail and the extension to more indices will be indicated, 

Let Xi, % j and £a, be correlated variables each being normally distributed 
about their respective means , rth , mz > with standard deviations ^ , <r a , ft > 
and let the correlations between the variables in pairs be represented by ria, 
r i3 , r i3 . Then the simultaneous distribution of these three variables will be 


1 

(2ir)iEVi<TaV3 6 ^ 


1 11" Ru{xi - mif R *afcci - rmf ' R 33 fa - vuf 

on 4 I 2 2 

* H-L <Tl 72 7 s 


(1) 


+ 2fln ~ - mi ^ X2 ~~ 


^ 2]j 13 fa - m i){%3 - m) 

(Tjffg 


■f* %Ri3 


(22 - m^fei - Wj) 


ffsffa 


dxidxidx j 


where 


R = 


1 r» r 13 

1'is 1 ri 3 

r ]3 r« 1 


and Rij are the respective second order minors of R. 


1 Rietz, H. L, “On the Frequency Distribution of Certain Ratios,” Annals of Mathe¬ 
matical Statistics, Vol. VII, No. 3, Sept. 1936, pp. 146-163, 

2 Baker, G, A., “Distribution of the Means Divided by the Standard Deviations of 
Samples From Ffon-homogcneous Populations,” Annals of Mathematical Statistics, Feb. 
1932, pp. 3-6. 
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If we make the transformation 


, 331 

2 l = —, 

3s 

X\ 1 = Sl 23 

Zi — —, 

ay 

X 2 - 22 S 3 

2 a = £ 3 , 

Xi = 2 a 

dxidxtdxs = 

Sa dZ{ dz 2 dz 5 


which is certainly valid if Xi , x 2 , x 3 , are all positive, then ( 1 ) becomes 

11^ fln(glg3 — tftl) 2 _|_ -^22 (22^ — fflQ 2 


e ^P* n D 


2«L' 


2 

ffl 


02 


(2sr)*K^ cri<r 2 erg 

^ + Baafe - m? ^ 2Rk (2123 - mdfazz - ffla) + ^ (gi 2 a - - m 3 ) 


-2 

*3 


(Tl(T2 <7 i«T3 

(2223 - ^)(za - Wlj) 


“J" 2^23 ' 


0 , 2CTa 


- 23 dzi 


dz% cfoj > 


H Xi , % 2 , arc all positive the corresponding distribution of Zi and 22 can be 
obtained by integrating ( 2 ) between the limits 0 and «> with respect to 2 a. 

If 3 i, irj, xz are all negative Zi and 22 are again both positive so that in order to 
get the total distribution for Zi and Z 2 it is necessary to add to the integral of ( 2 ) 1 
between the limits 0 and ® with respect to s a the similar integral of ( 2 ) with 23 
replaced by — z 3 . The result is 

& -1 

\Ar B* b 2 [ Vr -j S 3 , , R}b 2 \/tt 
-V —r —~ 1 e dz + —r *-1= 
s/2 a* a 7® 0 y 2 - 

/i ® 11 -® 1 ^22 2 Baa - 2 B w , 2 i?ia . 2B?a 

a = — Zi+-^2it—j-i - 2 i 2 a t- 21 H- 22 

0 " l 03 CTi ^2 O'LO'a 0 ' 20 ’s 


(3) 

where 


_n 

2e " 2 V sa 




CT 1^20*3 


b = 


■Bn ^ 1 Bk , , J2o3 , 

— Wtl2l H- 2 ' ^2^2 — ^3 + 

01 (Tg 0g 


Bn 1 BiJ 1 Bis 

-T" - Wl\2i + - 171 }2i 

(Tfl <X|, <T2 01 0J 


+ mi + 


B 23 , Bj3 

- m 3 Z 2 -b ■—~ fth 


0]03 


0203 


020* 


2 I B22_2 , Bm ...2' , 2 Ui2. , 2Bl3 ___ , 2B23 


Bll B I -WM M I -*^00 Z I “iWO , * 

e — — Wi H—j w-2 H—g- H-mi m2 4* r 

01 02 03 0102 0103 


mi m 3 H-mams. 

02 0"3 


The, same result (3) is obtained for Z \, and 22 negative, Z\ positive and 23 
negative, Zi negative and 22 positive. That is (3) is the simultaneous distribution 
of ?i and 22 . The extension to more than 2 indices is immediate. The form of 
the distribution of the indices and the denominator variable is the same as (2) 
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except that a, b, and c, the coefficients of z\, z 3 and the constant term respectively 
in the exponent of e, will be different in that they will include the new indices and 
the exponent on the denominator variable will be the same as the number of 
indices involved. The distribution of the indices will again be obtained by 
integrating from 0 to « with respect to the denominator variable. 

The case when all of the variables %i , , £3 are independent is especially 
interesting. If 7 * 12 , ?' i3 , are all zero then R = R n = f£ 22 = ff S3 = 1, J2 W — 
#13 = #23 = 0 and a, b, c, become a\ b’, c', respectively, 


.2 J 1 

*=%+%+\ 

<J2 CTg 


% 

*1 


b' = 


niiZi . mz 2 . m» 

2 ' 2 t a 

Q\ CT 2 0"3 


2 22 

, mi , ms 

® “ 2 T ~2 2 

CT 1 0*2 O '3 


Under these conditions and the further condition that , m 2 , mi are laTge with 
respect to vi, a* , respectively so that the integral term of (3) maybe neglected 
(3) becomes 


/mm fti 1 \ 2 



It is clear that Zi and are not independent in the probability sense for dis¬ 
tribution (4). 

The question as to the possibility of having the variables independent and the 
indices independent at the same time arises. Denote the distribution functions 
of Xj , £ 2 , £ 3 , by Xi(xj), Xsfa), Xs(xi) and of Zi, & by Zi( 2 i), Za(za). Then, if 
xt > 0, i — 1,2,3 it is necessary that 

( 6 ) ( Xi(ziZi)X2{ziZi)Xs(zs)zl dzs = Ziiz^Ziizs) 


a and b being suitable limits. 

For instance, let 

X,(x,) = i, 

Xfad = 4. 

Xi 

Xfcl) - x\ } 


1 < & < 3 


1 < xi < 3 


1 < 13 < 2 



182 


GEORGE A. BAKER 


then 


Ui i) = 

zi 

Z^Zz) = 

2a 

for value of Zi and z 2 within a straight line aided area the corners of which are 
[h i)i (h 1), (1,1) and (1,2). z i} and z 2 are not uncorrelated throughout their 
entire set of values but are for this particular set of values. Thus is appears 
that it is possible that the indices may be independent when the variables are, 
but not necessarily so. 

Indices should be used with care since it is very easy to draw invalid conclu¬ 
sions from the consideration of them, Usually it is better to use partial corre- 
. lation analysis to remove the influence of a third factor than to calculate indices. 



THE TYPE B GRAM-CHARLIER SERIES 

By Leo A. Aroian 


While much attention has been devoted to the Type A Gram-Charlier series 
for the graduation of frequency curves, the Type B series has been somewhat 
neglected. However the numerical examples to be presented later will show 
that the Type B series is very useful for the graduation of skew frequency 
curves. Wicksell 1 hQS demonstrated that the Gram-Charlier aeries may be 
developed from the same law of probability which forma the basis of the Pearson 
system of frequency curves. Rietz 2 following Wicksell gives a derivation of the 
Gram-Charlier series based on the binomial (q -f* vY- Jordan 3 gives a method 
for fitting Type B based on certain orthogonal polynomials which he calls G. 
He uses factorial moments because of the resulting ease in finding the values' 
of the constants. 

We shall consider the Type B series for a distribution of equally distanced 
ordinates at non-negative values of x. We shall find the values of the first few 
terms of the series and shall also show how the values of later coefficients may 
easily be found. We write the Type B series in the form 

(1) F(x) = Co + CiAi^(x) -j- Cz/Y\p(x) -f Ca&?\p(x) + aY(x) + CsA 6 f(a;) + CaAfy(x) 
where 


( 2 ) 


<K&) = 


-m ...t 

e ni 


xl 


m = ni, the mean, 


A^(ie) = \f/(x) - - 1) for x 0,1,2, ■ ■ ■ s, 


Let f{x) give the ordinates of the observed distribution of relative frequencies, 
SO that 2 f{x) = 1 . To determine the coefficients c 0 , ci , c 2 , ■ ■ * , c B , we have, 
using the method of moments, 

2[co^(a0 + Ci&ip{z) + caAVW "1" c aAV(*) +.+ c*AV(®)J * S/(#) « 1. 

2x[c 0 ^(a:) + CiA^(x) +.■.+ CflAV(s)] = 2 xf(x) = m, 

2fc 2 [cQ^(x) + CiAi/^x) +.+ Afy(x)] — ^xj{x) = p2 • 

i 

(3) SfcWto +. + CeAV(s)] - *x%) = pi, 

2x\c^(x) +.+ CfiA^(a;)] = Xx 4 /(x) = pi - 

Z*W(s) -T.“b CfiA ^{x)] ~ 2x/(x) — pj- 

2z*[ctfp(x) +.+ c*AV(«)I = 2s 6 /(x) « pi¬ 


rn 
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Hence we must find the values of 



E 


n = 0,1,2,3 • ■ * 
p = 0 , 1 , 2 , 3 • ■ ■ 


defining Afy(s) = ^(a;), We assume that we are dealing with distributions 

eo 

in which s is large, and that the error involved in substituting 2 x n & v ip(x) for 




2 %*A V '!'{$) is negligible. To find these summations in a straightforward 


**»& 


manner would involve too much labor, so we shall briefly discuss some properties 


-m s 

e 771 


of the generating function, \}/(x) = —:—, the Poisson, exponential, very useful 

vC [ 

in the graduation of frequency distributions of rare events. The first eight 
moments about the origin are: 


go = 1 =^2ip(x), iii - m as Xx^(x), ju 2 = 7ti -f- m 2 * 2xfy(:r) 

= m + 3m 2 -f Vi — 2xV(») 
n\ = 7?i + 7m 2 + 6m 3 + in => 2®V(®) 

(6) aJ = m 16 rri + 25m 3 + 10m 8 + m B =s Exfy(:t) 

»' B = m + 31m 2 *f 90m 3 + 66m* + 15?/? + m B = Ssfyfc) 

= m + 63m 2 + 301m 3 + 350 m* + 140m B + 21m 6 + m 7 = 2x>(s) 

/j' e = m + 127m® + 966m s + 1701m* + 1060m 6 + 256m 6 4- 28m T 4- m B 

= Sxfyfc) 

These may be found by the formula given by Jordan, 3 


( 6 ) 

Proof: 


„: +1 - »(* + »)■ 

#(s) „ #(g) _ 

dm m 


We multiply by s” and sum, giving (6). This result may readily be proved also 
by means of recursion formulas without differentiation. Now we must find the 
values of 


E 


0 


X n A v \p(x) 


We do this by proving 


n - 0,1,2, ■ • • 
V - li % 3> • * • 


(7) 




E *" 4 ,+ V(*> = -/ E *" 4 *\K«). 

f“0 pn[) 


£■100 
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Now 

(8) x - 1) - i P(x) = -A\p{x). 

Hence 

+ (‘)f(x - 2) + ... + (-l)V(x - a)l, 

since A*\p(x) = ^(x) — (j)*(x - O + ( 2 )^* - 2) + • • • + (-l)V(x - «)• 
Then by (8) 

^AV(x) = jj/Kx - 1) - \Kx) - f*^(x - 2) + f^lKx - 1) 

+ “ ( 2 )^* “ ^ + " ’ + (~!)V(x -s-1) 

- (-l)V(x - *)J. 

(9) ^AV(x) = -*(*) + (“ j X )<Kx - 1) - (' £ X )lK» - 2) + • • • 

- (~m(x - s - 1). 

= - [ M - (® | #(* “ D + (' t 0 * (l - 2 ) + •' • 

+ (-l)V(x - s -» 1)1. 

= —A ,+ V(x). 

We multiply (9) by x n f sum with respect to x, giving (7)'. 

Thus by use of (7) and (6) we get: 

2A*V(x) = 0, p = 1, 2, 3, ... 

SxA^-(x) = = -1. 

(10) = -^ (m + m!) “ - 2m - L 

2x 3 A^(x) = —3m 2 — 6m — 1. 

2x 4 A^(x) = — 4m a — 18m 2 - 14m — 1. 

Sx e A^(®) — —5m 1 — 40m 3 — 75m 2 — 30m — 1. 
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2x 6 Af(x) = -6m 5 - 75m 4 - 260m 3 - 270m 2 .- 62m - 1. 

Z*AV(») = 0, 2z 2 aV(z) = 2, XxW^ix) = 6m + 6. 

2xWt(x) = 12m 2 + 36m + 14. 

2zAV(z) = 20m 3 + 120m 2 + 150m + 30. 

XxWipix) = 30m 4 + 300m 3 + 780 m 2 + 540m + 62. 

. 2*AV(*) = 0, XxWHx) = 0, ZxWfix) = -6. 

2)AV(i) = -24m - 36, 2xA^(x) = -60m 2 - 240m - 150. 

2xW\j/(x) = —120m 3 — 900m 2 — 1560m — 540. 

(10) 2zAV(x) = 0, Sa; 2 AV(x) = 0, Zx^ix) = 24. 

2®‘aV (») = 120m + 240, 2* a AV(a:) = 0. 

2*‘AV(*) = 360m 2 + 1800m + 1560. 

2xA l i(x) = 0 , 2x&*t(x) = 0 ., 

ZxWiix) = 0, 2.-e 2 aV(z) = o. 

2z 3 Afy(:E) = 0, 2x a A V(*) = 0. 

2z 4 Afy(z) = 0, 2z AV(z) = 0. 

2xA 6 i(x) = -120, 2x®Afy(x) = 0. 

2xWip(x) = -720m - 1800, 2xW^(x) = 720. 

Finally we substitute from (5) and (10) into (3), and for we substitute 
Mn = jc ^ Hn-rm. Hence 
c 0 = 1 

Ci = 0 

Ca s £ (M2 - W). 

(11) c a as — ~ (ms “ 3/x 2 + 2m). 

C4 = — ^M3 + M2(ll — 6m) + 3m(m — 2)]. 

c 5 = —[mb — 10jU4 — ju3(10?n — 25) + 50/U2 (m — 1) — 4m(5m — 0)]. 

c, = 1 [m - 15^5 + *u(85 - 15m) + /ia(130m - 225) + w (45m 2 - 375m 

+ 274) - 15m 3 + 130m 2 - 120m]. 

It may be asked whether criteria may be given as guides for the use of Type B. 
In general Type B may be tried if either the skewness of the distribution to be 
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fitted is considerable, «g = -j > .6, or if m = /x 2 = ms approximately. The 

M2 

latter condition strictly would mean that alone is sufficient for a good 
graduation, if the fourth moment, im , is not used. The examples which follow 
are arranged to facilitate comparison with the Pearson system of frequency 
curves. We have an example each of Type I, III, IV, V, VI, and an example of 
the normal curve. 

Type I. Table 1. Here > .6 although m ^ /12 ^ ^3 ■ The first four 
moments, unadjusted, give an excellent fit by Type B, which is not quite as good 
as Type I. The degrees of freedom, according to Fisher, 4 have been taken into 
consideration here in applying the x 2 test. The two classes 13, 14, were grouped 
together for the x test. The actual numerical work is easily done on a cal¬ 
culating machine, although logarithms are necessary to find the value of e~ m . 
This example and the remaining are all taken from Elderton 6 with the exception 
of Type IV which is from A. Fisher. 6 

Type III. Table 2. The unadjusted moments are used. Here a 3 = 2,0833 
> .6, and m ~ y 2 approximately. The fit by Type B is slightly better than that 
by Type III. • We have for Type III P(x 2 > 12.8) = .007, n = 3, while for Type 
B, P(x 2 ) > 9.4 = .025 n = 3. Moreover the standard error of prediction for 
Type III is 11.2 and for Type B is 7.7. 

Type IV. Table 3. The rough moments were used. Although a 3 = .48 < .6, 
Type B gives a fine fit since ra = ai 2 = m 3 approximately. Here the results are 
given for Type B using 2, 3, and 4 terms of the series. This was done to show 
how the distribution changes with the addition of more terms. The superiority 
of Type B over Type IV is evident. The results for Type IV are taken from the 
class notes of Professor C. C. Craig. 

Type V , Table 4. Using the adjusted moments we have a comparison among 
Types V, A, and B. While the graduations may seem satisfactory, the x 2 test 
shows that the fit is poor in each case. The order of merit is Type V, Type B, 
and then Type A. The negative frequencies which appear in Type B may be 
due to the use of the adjusted moments. If we use the rough moments, the 
negative frequencies disappear. On the whole the fit by means of the adjusted 
moments is superior. 

Type VI. Table 5. Type VI using the adjusted moments gives an excellent 
fit. Even though a 3 is considerable, and ^2 = Ms approximately, four moments 
with Type B give a poor fit, and five moments, adjusted, achieve a very small 
gain. Five moments using the unadjusted moments give some improvement, 
but the — 2 frequency in the first class is objectionable. 

Normal Curve. Table 6. The normal curve provides a fine fit. P{x > .9) = 
.96, n = 6. The first two and the last two classes were grouped together for the 
test. The fit by Type B is less probable, P(x 2 > 8) = .15, n— 5. Type B has 
two discrepancies, the negative frequencies, and the fact that the total fre¬ 
quencies' (neglecting the —1) is 352. That Type B does so well is in itself 
quite amazing! 
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TABLE 1 


X 

Actual frequency 

Frequency computed 
by Pearson Type I 

Frequency given 
by Type B 

0 

34 

44 

42.4 

i 

145 

137 

121.3 

2 

156 

149 

168.7 

3 

145 

142 

156.8 

4 

123 

127 

120.5 

5 

103 

108 

94.9 

6 

86 

88 

82.9 

7 

71 

69 

72.2 

8 

55 

51 

56.7 

9 

37 

36 

38.0 

10 

21 

24 

23.1 

11 

13 

14 

12.0 

12 

7 

7 

5.7 

13 

3 

3 

2.4 

14 

1 

1 

.9 

m = 4.175 

a 3 = .712247 

Type I P(x 2 > 

4.36) = .88 

Mi = 7.66237 

on = 2.95214 

n (number of degrees of 

Ms « 15.1069 

c 2 = 1.74368 

•freedom) 

= 9 

M4 = 173.326 

c 3 = - .078298 

1 Type B P(x i >9.67) = .37 


ci = +. 094592 

> 

n = 9 


F(x) = \f/(x) +1.74368 Aty(a;) - .078298 AV(*) + .094592 AV(*). 



m = 1.33466 = -^- = 2.0833 a = .05356 

3/2 

M: = 1.44179 » c 3 = -.32510 

Ms = 3.60662 


f(a;) = f(s) + . 053564Y(z) ~ .325104^0) 
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TABLE 3 


Number of alpha particles from a bar of polonium in intervals of f of one minute 


X 

Frequency 

Type IV 

Type B 

2 terms 

TypeB 

3 terms 

TypeB 

4 terms 

0 

57 

50 

49.5 


58.2 

l 

203 

183 



199.8 

2 

383 

392 



386.1 

3 

525 

544 

532.3 

533.8 

523.9 

4 

532 




532.1 

5 

408 

417 



418.2 

6 

273 


254.8 

254.4 

260.2 

7 

139 

131 

137.1 


134.0 

8 

45 




56.7 

9 

27 

26 

26.1 


22.9 

10 

10 



9.6 

8.6 

11 

4 

4 


3.1 

3.6 

12 

0 

1 

.9 

.9 

1.6 

13 

1 

0 

.2 

.2 

.8 

14 

1 

0 



.3 


m = 3.87155 a 3 = .47844 

« = 3.69477 $4 *= 3.506536 

/i,= 3.39791 
iu = 47.86888 

F(») = +{x) - . 08839AV(^) - .00930AV(*) + .16810Aty(z). 

Type B,' 4 terms P(a; 2 > 4.50) = .72, n = 7 

Type IV P(z 2 > 10.8) = .15, n = 7 
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TABLE 4 


Mortality Among Female Nominees 


X 

Dea' *13 

Elderton 
Type V 

Type A 

TypeB 

2 terms 

TypeB 

3 terms 

Type B 

5 terms 

TypeB 

5 terms 

0 

4 

4 

2 

1.4 

-6.9 

-.4 

4.1 

1 

18 

10 

15 

26.3 

7.1 

9.4 

13.1 

2 

53 

80 

78 

109.7 

100.1 

84.6 

77.4 

3 

265 

261 

235 

248.3 

268.4 

252.3 

242.5 

4 

438 

441 

426 

379.5 

418.8 

425.9 

427.4 

5 

525 

480 

521 

432.7 

461.0 

484.0 

494.1 

6 

342 

381 

411 

388.8 

388.4 

402.6 

408.1 

7 

253 

247 

225 

285.4 

263.5 

259.0 

253.9 

8 

128 

137 

107 

170.8 

145.5 

132.2 

124.9 

9 

82 

68 

66 

84.3 

68.3 

58.6 

54.1 

10 

28 

32 

44 

32.9 

28.2 

26.2 

26.4 

11 

12 

14 

22 

8.6 

11.0 

13.9 

16.4 

12 

8 - 

6 

8 

-.01 

4.7 

8.2 

10.7 

13 

5 

3 

2 

-2.1 

2.1 

4.3 

5.9 

14 

1 

1 

0 

-1.5 

1.3 

2.0 

2.5 


Adjusted moments: 
m = 5.30435 = .703564 

m = 3.573345 <*4 = 3.996196 

M3 = +4.752437 

in = 51.02659 

M3 = 193.439125 


Rough moments: 
m = 5.30435 
Vi = 3.65668 
vt = 4.752437 
Vi = 52.85276 
Vi = 197.39949 


Type A: /(t) = <p{t) + .117261 /(i) + .041508/(0 
Type B: F(x) = f{x) - . 86550A V(^) - .77352AV(s) 

+ . 02814A V(Z + .57459AV(*) 

Using uncorrected moments 


Type B: F(x) = f(x) - .82384AV(*) - .73185AV(x) 

+ .03192Aty(x) + .94033A 6 ^(x) 
(last column above) 
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TABLE 5 


X 

Frequency 

Type VI 

TypeB 
i terras 

Type 1? 

5 terms 

0 

1 

1 

-9.5 

-2.0 

l 

56 

50 

83.2 

69.9 

2 

167 

168 

141.6 

143.1 

3 

98 

100 

102.3 

110.7 

4 

34 

36 

41.5 

40.2 

5 

9 

10 

8.7 

4.6 

6 

2 

2 

.05 

2.0 

7 

1 

.5 

-.4 

1.0 


Corrected moments: Rough moments: 
m m 2.402174 m = 2.402174 
Hi = . 928835 w =1.012169 
Hi = .893096 ms = .893096 

Hi = 4.088800 ^ = 4.313176 

Hi = 11.28304 
a 3 = .87704 
o 4 = 4.2101 

Type B, adjusted moments: 

F(x) = f(x) - . 73667A V(«) - .48516Aty(a:) - .06424Aty(a;) + .10365Aty(s) 
*Type B, rough moments: 

F(x) = f{x) - .69805Aty(x) - .44654AV(«) - .06587AV(x) + .15165Aty(*) 

* This is used in last column of above. There is a slight error here, which however will 
not affect the results materially. The third decimal place may be slightly wrong. 
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TABLE 6 


Normal curve 


X 

Frequency 

Normal curve 

Type F 

0 

.6 

.6 

2.3 

1 

2.8 

2.7 

4.7 

2 

11.5 

10.9 

8.7 

3 

27.7 

30.1 

25.2 

4 

59.1 

58.4 

55.2 

5 

84.7 

80.1 

79.5 

6 

74.1 

76.9 

80.1 

7 

50,5 

52.2 

58.1 

8 

23.2 

25.0 

29.7 

9 

12.2 

8.4 

8.6 

10 

1.3 

2.4 

-.9 


Moments corrected: 
m = 5.393443 
H - 2.769635 

H * .029805, ju » 22.40663 
as = .0064 
0-4 *» 2.920997 


TypeB: f(») * - 1.3119AV(&) - .4179AV(a) + 2.1625AV(a?) 

Colorado State College 
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A TEST OF A SAMPLE VARIANCE BASED ON BOTH TAIL ENDS OF 

THE DISTRIBUTION 

By John W. Fertig 

With the assistance oe Elizabeth A. Proehl 1 
(1) Introduction 

In testing the hypothesis, say Ha, that an observed sample E of size N has 
been drawn from a normal population for which the standard deviation, <r, has a 
particular value, cr 0 , one may form the ratio 

S (*, - mf/ol = .(I) 

*“1 do 

if the population mean m be known, or 

v' - S (xi -xY/d = ^ .(II) 

o-j 

where x is the sample mean, if the population mean be unknown. The proba¬ 
bility of obtaining a larger (or smaller) value of v or v' than that observed may 
readily be obtained from the appropriate tail area of the x 2 distribution with 
n = N or n = (N — 1) degrees of freedom respectively. The alternative 
hypotheses to Ha concerning the normal populations from which the sample 
may have been drawn assign different values to a and form a set of hypotheses, 
0. The members of SI may be classed according to whether they specify 
<r > <r 0 , or <r < tr 0 . The practice of regarding only one tail of the distribution, 
the upper or lower depending on whether v > N or v < N, is tantamount to 
accepting as admissible alternatives to H a only one of the classes of fl. 

The alternatives may sometimes be limited to one class or the other through 
some a priori knowledge, or the problem may be such that only one of the classes 
is relevant. However, since this is not generally the case, some method of 
considering all of the alternatives is needed. When testing hypotheses con¬ 
cerning the mean of the sampled population, the problem is quite simple, since 
the distribution of means is symmetrical. Thus, the “corresponding” value to 
any positive deviation, (x - m), is the negative deviation of the same magnitude. 
Merely doubling the tail area pertaining to either of the deviations will serve to 
take account of both classes of alternatives, i.e., those in which m > m and 
those in which m < m 0 . The problem is more difficult in the case of v or v\ 


1 From the Memorial Foundation for Neuro-Endocrine Research and the Research 
Service of the Worcester State Hospital, Worcester, Massachusetts. 
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■since the distribution is not symmetrical. In addition to the value of v or v f 
pertaining to the observed sample we require a “corresponding” value at the 
other end of the distribution. The definition of “corresponding” which is 
accepted will determine the required value. There may be a number of such 
definitions but not all of these will be equally acceptable. The value of v 
which delimits an equal tail area specifies one of the possible definitions of 
“corresponding.” Another definition would require that the ordinates at the 
two values of v be equal. 

The Neyman and Pearson Approach. Generalized procedures for. testing 
statistical hypotheses have been elaborated in recent years by J, Neyman and 
E. S. Pearson (1-5). These have considerable philosophical appeal and will be 
traced as a basis of solution of the immediate problem. A test of a hypothesis 
Ho consists essentially of a rule for rejecting Ho when the observed sample E 
falls within a suitable critical region w of the N-dimensioned sample space W, 
and of accepting Ho when E falls in (W — w). In testing any hypothesis two 
types of error may be made: 

i) Ho may be rejected when it is true; 

ii) Ho may be accepted when some alternative hypothesis, Hi , is true. 

Errors of the first kind may be considered “equivalent” since, if a true hypoth¬ 
esis is to be rejected, it is immaterial which one is chosen. Furthermore, the 
first type of error can be controlled through our choice of the size of w , say a . 
The size of w represents the probability of a sample E being an element of w 
when the hypothesis Ho is true. This probability may be designated briefly as 
P{E tw\Ho}' Then 

P{Etw\H»} = j ••• j p{E\H<>) dxxdxz «* * dx N — a .(Ill) 

where p(E | Ho) is the elementary probability law of the sample when Ho is 
true, i.e,, 

p(E | Ho) = p{x x , x 2 , • • • x N | Ho).(IV) 

Errors of the second type, however, are not equivalent, since their consequences 
depend on the difference of the true hypothesis from Ho . The utility of a test 
of Ho will depend largely on how it controls the second type of error. Ideally, 
the selection of a critical region should take into consideration the probabilities 
k priori of the hypotheses composing 12. Since these probabilities are generally 
unknown, tests may be sought which are valid independently of them. 

A distinction must be made between simple hypotheses which specify com¬ 
pletely the elementary probability law of the sample, p(E)> and composite hy¬ 
potheses which specify the law subject to one or more undetermined parameters. 

(2) Simple Hypothesis Concerning Population Variance 

A test based on a critical region may be called independent of the probabili¬ 
ties k priori of the alternative hypotheses if it is more powerful than any other 
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equivalent test for all of the alternative hypotheses (3). An equivalent test 
is one based on a region w x of the same size, a, i.e., 

P{E e Wo | Ho} = P{E e Wi | IIq) ~ a .(V) 

The power of a test based on any critical region, as w t , is the probability of its 
rejecting a hypothesis Ho when some other hypothesis Hi is true. That is, 
it is the probability of E falling in w x when Hi is true. Denote this power by 
P{E evJx | Hi}. The greater the power of a test, the smaller the risk of the 
second type of error. If tests as defined above exist, they minimize the proba¬ 
bility of the second type of error. Furthermore, the probability of the first 
type of error is no larger than a. Neyrnan and Pearson (2) have designated 
regions satisfying this definition as Best Critical Regions for testing Ho with 
regard to the set ft. If there is no such Best Critical Region, some compromise 
region must be chosen. 

A necessary and sufficient condition for w Q to be a Best Critical Region with 
regard to an alternative Hi is that within Wo 

piE | Ho) < kp(E | Hi) ...(VI) 

where k is some constant depending on a. If this inequality is true for any Hi , 
wo will be a Best Critical Region for the set ft. 

Neyrnan and Pearson (2) have shown that in testing the hypothesis that 
a- = <tq , when the population mean m is known, there are two Best Critical 
regions, one pertaining to the class of alternatives for which <r < tr 0 and defined 
by v < Vi , the other to the class a > <r 0 defined by v > z; 2 . v x and v% are values 
of v so chosen that the size of the critical region shall be a. Although there is 
no Best Critical Region for all of the alternatives, the choice of a compromise 
critical region should still depend on its control of the second source of error, 
that is, on its power for the various alternatives (4). Such a compromise 
region may be designated as a Good Critical Region. What is needed is a 
region w 0 of size a defined by the inequalities v < and v > v% . If V\ and i> 2 
are taken as the values cutting off equal tail areas, then the power of the test 
will be less than a for some values of a less than <r 0 . For those values of a, Ho 
would be accepted more frequently than if it were true. Thus a first require¬ 
ment for a Good Critical Region is that its power should nowhere be less than a, 
the value when Ho is true. Of all such unbiassed Critical Regions of size a, 
Wq should then be selected so that its power is everywhere greater than that of 
any other equivalent unbiassed region. 

Critical Regions sufficiently satisfying the above requirements can often be 
obtained by stipulating that the first derivative of the power function with 
respect to 8 , the parameter under consideration, shall be zero at 6 = do , and 
that the second shall be a maximum there. Then not only does .the probability 
of the second source of error decrease as we move away from 0o, but it decreases 
most rapidly in the vicinity of d 0 . Critical Regions satisfying these conditions 
are called unbiassed Critical Regions of Type A, (4). Under certain assumptions 
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concerning the nature of the elementary probability law p{E | d) it can be shown 
that tuo is defined by the inequalities <pi < C\ and cpi > c 2 where Ci and c 2 satisfy 
the conditions 



1 «* 

/ p(<Pu d<pi = 1 — « . 

Jc 1 

.(VII) 


j-C2 

/ V>iPW d<pi = 0 . 

Jci 

.(VIII) 

where 

d log p(E 1 6) 

dd 6^8 q 

.(IX) 


and p{(pi) is the distribution function of <p x . 

In applying these results to the testing of the hypothesis that <r = <tq when 
the population mean is known , 

<Px = (v- N) Ao. .(X) 


Obviously pfy), the distribution of v ) may be considered instead of pfa). Wq is 
defined by the inequalities v < V\ and v > where 


dv + / p{v) dv = a\ + a 2 = a 


rvi 

I p(v) < 

/ {v — N)p{v) dv = 

Jn 


v m e-' 12 


= 0 


..(XI) 

.(XII) 


Wq so defined is also of type rii, that is, its power curve lies everywhere 
above that of any other equivalent region, vanishing in the first derivative at 
<7 == cr 0 , (4), 

The use of Wq as the appropriate critical region is equivalent to the use of r 
as a test criterion, where 


= r y>. .. XIII ) 


That is, a value of v yielding the same r as the observed v may be taken as the 
corresponding value. Reference to the appropriate tables and summing of the 
two tail areas gives P T , the probability of obtaining a smaller value of r when 
Hq is true. Hq may be rejected if P T is less than some previously fixed number, 
say a. If the distribution of r could be evaluated the necessity of dealing with 
two values of v would be obviated. 

The criterion r is equivalent to that deduced by the use of maximum likelihood 
ratios (6). Thus, 


JV 

- 9 

p(E | & z ) = (27ro p2 )~ JV/2 e 


(XIV) 


2 The solution is the same in terms of a 2 r 
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Maximizing p(E | <r 2 ) for fixed E and all possible a 1 we have 

2W.CE | <r 2 ) = N* r ‘ 2 2t S (x< - m) 2 J ** e - "' 2 .(XV) 

.(XVI) 

.(XVII) 


_ gCSK) _ -kt~NI2 tf/2 

" Pm«.(S|ff 2 ) 

- N-* n f n r . 



The h th moment coefficient of X about zero, j 4(X), is given by 

+ k) 

2 


r 

MhOO - - 


L' 


r(N/2) 


(2e/N) M n (1 + h)‘ 


W(l+H/2 


. (XVIII) 







Probability that a sample has been drawn from a normal population with a specified variance or standard deviation 

Degrees of Freedom, n 
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For N infinite, (-21og«X) will be distributed as x 2 with one degree of freedom. 
For finite values of N, however, we have not been able to evaluate the dis¬ 
tribution of X, although the distribution of the Incomplete Beta Function serves 
as a good approximation. Approximate distributions for several values of N 
have been obtained. P *, the probability of obtaining a smaller value of X 
than that observed, as obtained from these distributions agrees well with the 
sum of the tail areas pertaining to V\ and V 2 yielding the same value of X (or r). 


The construction of tables is simplified by taking (1) 

logic X = IV/2(lo gl o e - k) .(XIX) 

That is, 

X - log, x = k log, 10.(XX) 


where x = v/N. Equation (XX) is independent of N and may be solved once 
and for all for x, given fc. 3 4 In Figure 1 is plotted the graph of equation (XX). 
For convenience, the branch of the curve giving the roots greater than unity 
has been folded back with altered scale from the minimum value of k, log 10 e, 
occurring at 2 = 1. Table I was then constructed by multiplying the two 
values of x for a given k by (N/2)\ referring to the Tables of the Incomplete 
Gamma Function (7) with p = (N — 2)/2, and adding the resulting two tail 
areas. The values for the odd numbers above 12 were obtained by interpolating 
between the even numbers. For N = 1, (a;)* was used as a normal deviate, 
The values in Table I should be correct to four decimals. Table I is entered 
with the number of degrees of freedom, n, on which x is based. In the case of the 
simple hypothesis this is N. 

The following may serve as an illustration: Blood urea nitrogen determinations 
(mg./lOO cc.) were made on a sample of 25 schizophrenic patients. The mean 
was found to be 15.56, the variance, 10.486. Previous investigation of blood 
urea nitrogen on a large sample of normal control subjects gave a mean of 16.03 
and a variance of 20.268, which for the purpose of the example may be considered 
as the population parameters. Then we may wish to test the hypothesis that 
the variance of the sampled population, cr 2 , is = 20.268, knowing the mean 
of the sampled population to be 16.03. Calculate 

* = = .528 

ol 

Referring to Fig. 1, the value of k is about ,505. Turning to Table I with 
k = .505/ n = 25, P is found to be .0457. We should thus be inclined to reject 
the hypothesis. 

For N small, the area of the tail of the distribution near zero is considerably 
larger than that at the upper end. As N increases the distribution of v becomes 


3 If the solution were explicit the distribution of X could easily be deduced from that of x. 

4 k obtained directly from (XX) is .507, corresponding to P = .0427. 
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more and more symmetrical and the two areas approach equality. Even for 
N = 50, however, they are rather unequal, so that merely doubling the area 
pertaining to the observed v does not give a sufficiently accurate approximation. 
For N > 50 an approximation correct within several units in the third decimal 
place may be obtained by taking \ / 2N(\/x — 1) as a normal deviate. This 
assumes that the standard deviation is normally distributed with variance <rl/2N, 

(3) Composite Hypothesis Concerning Population Variance 

Here H 0 specifies^nly the value of the parameter 6 = 0 O , leaving undetermined 
the value of a second parameter, v . Thus, Ho consists of a subset, w, of simple 
hypotheses, each of which specifies a different value for v. Any simple hypoth¬ 
esis specifying different values of both parameters, 6 and v> is an alternative 
to H q . These alternatives form the set ft. The elementary probability law 
determined by Hq is p(E \ Ho) = y(E | $ Q v), while that determined by an alterna¬ 
tive hypothesis Hi is p(E | Hi) = p(E | In testing composite hypotheses 
the first requirement is to find regions “similar” to W with regard to v, i.e., such 
that the chance of rejection of a true hypothesis, P{E e w | H 0 ), equals a for all 
the values of v specified by the simple hypotheses composing IIo . A test based 
on a similar region w 0 may be called independent of the probabilities k priori, 
if its power with respect to all the alternatives of ft is greater than that of any 
other similar region Wi of the same size, a, (3). Let 

= 9 log p(E | Ov)/dv\ 6 ^B 0 .(XXI) 

Then the equations p 2 = constant will describe hypersurfaces in iV-dimensioned 
space, on one of which the observed E must fall. Under certain assumptions 
pertaining to the law of elementary probability it can be shown (2) that a 
necessary and sufficient condition for w to be a similar region is that 

P{E 6 wfa) I Ho) = aP{E e WM | Ho}.(XXII) 

for all values of v> 2 , where w(<p 2 ) and W{vl) are parts of the surface <p 2 = constant 
common to w and W respectively. A similar region is’ then built up of these 
parts w((p 2 ) obtaining for the various values of <p 2 . The Best Critical Region, 
Wo , for a particular simple alternative, Hi , must then be composed of pieces, 
wofe), maximizing P{E | Hi}. The problem is the same as for simple 

hypotheses except that we shall be working in a space TF(<p 2 ) of (N — 1) dimen¬ 
sions. wq(<pI) is defined by the inequality 

p(E ] Hi) > fefe) p(E | Ho).(XXIII) 

where £(<#>) is some constant depending on a. If w 0 fe>) is the same for all Hi , 
then Wo is the Best Critical Region for testing H 0 with respect to ft. 

Neyman and Pearson showed (2) that in testing the composite hypothesis that 
o- = cr 0 when the population mean is unknown there are two Best Critical Regions 
corresponding to the class of alternatives a < ^ and a > <r Q , defined respectively 
by the inequalities v ' < v[ and v } > If the whole set of alternatives, ft, is to 
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be considered some compromise region must be sought. Dealing with the case 
where similar regions exist Neyman (5) defines a Critical Region as unbiassed 
and of Type B if the first derivative of the power function, P(E e w | Hi), with 
respect to 9 vanishes at 9 = 8o, and if the second derivative at that point is a 
maximum. Let 


<pi = 


d log p{E ] Ov ) 
38 


(XXIV) 


Then it can be shown that the desired region will be defined by the inequalities- 
(pi < and ipi > fo>(<£ 2 ) where /ci(<p 2 ) and ki(<pi) are determined to satisfy 


and 



(1 - odpiw) 


(XXV) 


J r *2 (^ 2 ) 

<pip((pi(Pi) d<pi = (1 - 


<*) J <pip(<pm) d<pi 


(XXVI) 


where p(<^) is the distribution function of 92 ) and p (^ 1 ^ 2 ) is the simultaneous 
distribution of (pi and <p 2 . 

Applying equations (XXV) and (XXVI) it follows that the appropriate 
Critical Region is defined by the inequalities v ' < v[ and v ( > u 2 where 


and 


a = OLi + Q!2 

4 



p(i/) dv 1 + 



(XXVII) 


v K N -m-w 


= 0 


(XXVIII) 


where p(ti') is the distribution function of v'. 

The use of the unbiassed Critical Region of Type B corresponds to adopting 
as a criterion 


v M~w-w = r , 


(XXIX) 


Since v’ derived from a sample of size N is distributed as v derived from a sample 
of size (N — 1 ), it follows that r' is equivalent to the r of equation (XIII) based 
on a sample of size (N — 1). Therefore Table I may also be used for testing 
the hypothesis that <r = o - 0 whatever be the population mean, by entering with 
the number of degrees of freedom, N — 1 . 

In the example previously used, compute 

x = 4 = 0.517 

From Figure 1, h is approximately .51, corresponding to P = ,0422. 
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?•' is not the same as the maximum likelihood ratio V (6). 

x , = Pm«(^|gow) _ pj~im v iNii e -w-N) = ^-n^'W. _, (XXX) 

proa X (£|cr 2 m) 

As A becomes infinite the distribution of V is the same as that of the X of (XVI). 
For N = 49, the probabilities corresponding to X' agree with those using r' to 
within a unit in the third decimal. 

The X' test is biassed as may be seen in Figure 2 where we have plotted the 
power of the test based on the region w defined by v[ = 3.187, tig = 22.912 for 
which a = .0436 + .0064 = .0500, on the assumption that at — 1.0, for N = 10. 
Although the criterion is biassed it is slightly more sensitive to alternatives 



Fig. 2. Comparison of Critical Regions for v'. Ho Specifies a\ = 1.0. N = 10, 

specifying g < a\ than is the unbiassed Critical Region of Type B defined by 
v{ = 2.953, v'i 20.305, a = .0339 + ,0161 = .Q500. The criterion of con¬ 
stant distribution, p(i/), 

^ c , .(XXXI) 

has also been considered. In this case v[ = 1.903, = 17.391, a = .0071 + 

.0429 = .0500. This criterion is biassed for some alternatives specifying 
<? < tro, but its power curve lies above that of the unbiassed region for a 1 > <rl. 

Apparently the bias may be shifted at will by changing the exponent of a'. 
This may be desirable if greater weight is to be given to one class of alternatives. 
In fact decreasing the exponent of v 1 to 0 produces the Best Critical Region 
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for the class of alternatives specifying a > <r\ f and defined by = 0 , v 2 = 16.919 
for a = .0500. No region can be found giving greater power. On the other 
hand this region is insensitive to alternatives of the other class. Increasing the 
exponent indefinitely produces the Best Critical Region for the other class 
defined by = 00 and v[ = 3.325 for a = .0500. 
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ON THE POLYNOMIALS RELATED TO THE DIFFERENTIAL EQUATION 

1 dy _ go + <hx N 
y dx bo + fax + fax 2 D 

By Frank S. Beale 


Introduction. In a previous issue of this Journal, 1 E. H. Hildebrandt has 
established the existence of a general system of polynomials P n (h, x) associated 
with the solutions of Pearson’s Differential Equation 


(R) 


i dy - K 

y dx D’ 


N and D being polynomials in x of degrees not exceeding one and two respectively 
with no factor in common. 

It was'shown that the polynomials P n (k, x) s P n themselves satisfy certain 
differential equations and a recurrence relation. The classical polynomials of 
Hermite, Legendre, Laguerre, and Jacobi are special types of P n (h, x). Since 
the classical polynomials are employed rather extensively in statistical theory, 
certain of their properties are of special interest. 

It is the purpose of this paper to determine from Hildebrandt’s general equa¬ 
tions some new properties of P„(fc, x) and to apply these properties to the 
classical polynomials. The paper consists of two parts. In part I some 
theorems are established concerning common zeros of D and P„ . In particular, 
a theorem is established to exhibit the conditions under which the zeros of P n , 
which are not zeros of D , are simple., In part II a method is outlined for the 
classical polynomials by which one can determine the number and location of 
the real zeros in the various segments into which the zeros of D divide the x axis. 
The points of inflexion and the degree of the polynomials are also considered. 

A new feature of the method employed is, we believe, its being based upon the 
use of differential equations of first order, for most part, while other investi¬ 
gators 2 have employed differential equations of second order. As to the results 
obtained, the author believes them to be partly new. They have points in 
common with the results of Fujiwara, Lawton and Webster, 

1 Systems of Polynomials Connected with the Charlier Expansions, etc., Annals of Math, 
Stat,, Vol. II, 1931, pp. 379-439. 

5 M. Fujiwara: On the zeros of Jacobi’s Polynomials, Japanese Journal of Math., Vol. 2,. 
1925, pp. 1, 2. 

. W. Lawton: On the zeros of Certain Polynomials Related to Jacobi and Laguerre Poly¬ 
nomials, Bull. Am. Math. Soc., Vol. 38,1932, pp. 442-449. 

M. S. Webster: Thesis, Univ. of Penna. These results were kindly communicated to 
me by Dr. Webster. 
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I. Theorems Concerning Common Zeros of P„(/c, x) and D 
The following equations will be employed later: 

(1) P n+l (K x) = [N + (k - n)D']P n (k, x) + DPn(k, x). 

(2) Pn+iik, x) = (n+ 1) N' + ‘AjzJt D"] P n (k, x). 

Pn+iQc, x) = [N + (k- n)D']PJk, x) 

(3) + n[tf' + 2 A ~n±} jy/J DPUk> x) _ 

These are not explicitly given in Hildebrandt’s Paper but the method of obtain¬ 
ing them is outlined there in detail. 

We shall make use of the following lemma which we state without proof. 
Lemma (1). Let P n (x) be a polynomial of degree n. If both P n and P' n contain a 
factor (x — a)” 1 , m < n, then P n contains the factor (x — a) nl+1 . 

We also need an expression for P^h(k } x). By repeatedly differentiating (2) 
and eliminating P„(&, #) we get, 

*) - fi (» +1 - i) far' + 2|! ~” + * d"1 p i), 

(4) "* L 2 J 

q =s 1, 2, • • • (n + 1) # 

Theorem h . If D is a perfect square , D' is not a factor of P n+i (fc, x) } n = 
0,1,2, 

Proof: Assume D' to be a factor of P n+i . From (1), D f is either a factor of 
P n or of N + (k — n) D'. But D f is not a factor of N + (fc — n) D f as this 
implies that D f is a factor of N contrary to hypothesis on (R) that D and N 
have no factor in common. Thus, D f is a factor of P n , and by a repetition of the 
reasoning a factor finally of Pi, which as it was just pointed out, is impossible. 

Theorem I 2 . Set D = ( a x x + ft) {a 2 x + ft), D not a perfect square. If 
aw + j3 i, i = 1 or 2, is a factor of P n , then (a& + Pi) q is a factor of P„ +Q _i, 
q = l, 2 , 3 , ... 

Proof: From (1), atf + P * being a factor of P n and D , is also a factor of 
P n +i. From (2), cux + (3i is a factor of Pl + i. From Lemma (1) it follows 
that (ctiX + ft) 2 is a factor of P n+X . Continued repetition of the reasoning 
establishes the theorem. 

Corollary. If both a x x + ft and a 2 x + ft are factors of P n , then D q is a factor 
of P n+Q—l • 

Theorem 1 3 . Assume D of the same form as in Theorem I 2 . If a iX + ft , 
i = 1 or 2, is a factor of P n +i and no higher power of a^x + Pi is such a factor then 
aiX + Pi is a factor of N + (k — n)D f . 

Proof: From (1), onx + ft being a factor of P n +1 and of D is also a factor of 
either N + (k — n)D' or of P„ . But a { x + ft a factor of P n requires, from I 2 , 
that (aiX + ft ) 2 be a factor of P n +1 contrary to hypothesis. Thus, arx + pi is a 
factor of N + (k — n)D'. 
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Corollary, If (a& + ft) (a 2 x + ft), (a x , a 2 ^ 0), is a factor of P «+i and no 
higher 'power of either aix + ft or a^x + ft is contained in P „+i then N (Jc — n) 
D' = 0. For from I 3 , N + (Jc - n)D' contains (atf + + ft) as a factor 

which implies N + (k — n)D’, being linear, vanishes identically. 

Theorem h. If + ft) s and no higher power of cox + ft is a factor of 
P n+q -1 then a# + ft and no higher power of a { x + ft is a factor of P„ . 

Proof: Let us write, 

(A) P„ +4 -i = (ctiX + ft) q </.„-!, g>n~i = a polynomial of degree < n — 1 which 
does not contain the factor ottx + /?,■. Taking the (q — l) sfc ' derivative of (A) 
by Leibnitz Theorem, we get, 


(B) 


P 


(9-1) 

n+fl-1 



(ffli x + ft) 4 


dr 


dx 5_1 “ 


:• 4>n—l • 


On setting q = q — 1 in (4) there results, 


(o pivA = ii fa+? -1 - *) 


N’ + 


2k 


q + i+ 2 


D” Pn- 


From (B) we see that a# + pi is a factor of Pn+i-i* No higher power of 
ctiX + ft is such a factor. From (C) our theorem now follows. 

Corollary ( 1 ). Under the hypotheses of Theorem 1 4 , ct{X + pi is a factor of 
N + (ft — n + l)P r < This follows at once from h and 7 3 . 

Corollary (2). If D q = (ai$ + ft ) 2 (a 2 £ + ft) 2 , (0:1, a 2 5* 0), is a factor of 
Ptt+2-1 no higher powers of either a x x + ft or a 2 x + ft are factors, then N + 
(Jc — n + 1 )D' = 0. For the linear expression iV + (ft — n + 1 )D* contains, 
from Corollary'(1), the quadratic factor (a x x + ft) (a 2 x + ft). 

The following lemma can be easily established and is given without proof. 

Lemma ( 2 ). Assume D of the same form as in Theorem I 2 . Then there is only 
one value of s for which N + sD f contains arx + ft as a factor . 

Theorem I 6 . Assume D of the same form as in Theorem I 2 , If N + (k — n)P' 
contains oux 4- ft, i = 1 or 2, as a factor , i/ion P rt+ i contains ot{X + ft and no 
/up/ier power 0 / a# + ft as a factor. 

Proof: From (1) we see that P n +i contains cax + ft at least to the first power 
as a factor. Again from (1), if P n +i contains a higher power of ctiX + ft as a 
factor, this means that both P n and P' n contain arx + pi at least to the first 
power as a factor and from Lemma (1) it follows that P n contains cax + Pi at 
least to the second power as a factor. By corollary (1) from Theorem / 4 it 
follows that arx + ft is a factor of N + (k — nf)D f for n x < n, contrary to Lemma 
(2). 

Theorem h . If ot x x + ft and a 2 x + ft are factors of N + (k — nt)D f and 
N +■ (ft — nf)L> } respectively , («i, a 2 ^ 0), then P^ = 0, y > n\ + n 2 . 

Proof: From Theorems 7 6 and J 2 we see that (a x x + ft)” 2 ( a 2 x + ft)” 1 , of 
degree n x + n 2 , is a factor of P na+n 1 , of degree n 2 + n x at most. Similarly, 
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fax + ft ) 712+1 fa x + ft)" l+1 , of degree m + % + 2, is a factor of P» 2+ftl+1 , of 
degree n 2 + n% + 1 at most. This implies P n2+ni +i s 0 . Hence, s= 0, 

/i > ni + n 2 . In fact, ( 1 ) shows that 0 implies P v s 0 , y > M . 

Theorem h . isw D of ffte sawe/om as m Theorem I 2 . Then P n4i = 0 , 
Pn ^ 0 , implies either N + (k - m)D r = 0 , m < n, or there exist two values of 
m, (mi, m 2 ), shc/j that N + (ft — mi)P', N + (ft — m 2 )P ; contain as factors 
a x x + ft and o^x + 02 respectively , (mi, m 2 < ft). 

Proof: Setting P n+l = 0 in ( 1 ) gives, 

( 1 °) [AT + (ft - n)Dq P n + PPl - 0 . 

If P n s const., 1° shows that iV + (k — %)ZF = 0 and our theorem is verified. 
Suppose Pn ^ const. We get from ( 1 °), 

p , _ [AT + ft - n)Zy]P n 

71 -5 * 

Thus, D is a factor of the numerator, and our theorem now follows from Corolla¬ 
ries (1) and (2) of Theorem h . 

Theorem J 8 . If N + (ft - m)D' ^ 0 , m = 1, 2 , * • • n, and if N + (it — m)D' 
contains neither a x x + 0 i, nor a^x + 0 2 as factors, then P n +1 and D toe no/actors 
in common. This follows at once from Theorems I 2 and I 4 which constitute a 
necessary and sufficient condition that P n and D have factors in common. 

Theorem U . If N s const, and if D is linear , aZt P n are constants, n = 1 , 2 , 3, 

• • • . This follows directly from ( 2 ). 

Theorem Z 10 . 7/ 2V 7 + ^ D" ^ 0, m = 1, 2, • • • (n — 1), aZZ zeros 0 / P* 
which are not zeros of D are simple , 

Proof: Suppose P n has a multiple zero x = a which is not a zero of D, Then 
(1) shows that a is a zero of P n4 i. From (2), a is a zero of Pn+i. From. 
Lemma (1), a is at least a double zero of P n + 1 . Furthermore, (3) shows that a 
being a double zero of P» and of P n+ 1 is also a double zero of P„_ 1 . By a con¬ 
tinued application of (3), it follows that a is a double zero of Pi which is impos¬ 
sible since Pi is of degree <1. 

II. Concerning the Zeros of P n {k, x) 

The polynomials P„(fc, x) are defined by Hildebrandt 3 as follows: P n {k, x) = 

1 n n 

i j) n ~ k — j) k y where y is a non-identically vanishing solution of the differential 
y dx n 

equation 

1 dy _ a 0 + a\X _ W 
y dx h+bix + bzx* D' 


3 L.c. pp. 400-401. 
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The Jacobi Polynomials are defined as follows: 

= *'-“(1 - xY ~*^ y +a ~\ 1 - «, l 3 


real. It follows that J n (x, a, ft) is a special type of P n {k, x) with N = {-IS-a) 
x + a, D = »(1 - x), n = k + 1, whence, 

N' = -ft-a, D' = 1 - 2x, D" = -2; D(0) = D{ 1) = 0, 


x = 


P^k, x) s N + W = 0 for 
“ + * -fi-C 


cl -J- j8 “f" 2k 


2k 


In determining the number and location of the real zeros of the Jacobi Poly¬ 
nomials we employ the following notations: 1 


Pi(k, x) = 0 for x = cci t k, 3 } i = lj 2, 






0 = IV' + ^1—Hd" = -ft - a - 2k + w, » = 1, 2, •. • k, 

Jt 

ju = [iV + (k — = a + (fc — ft), 

* « [tf + (* - w) 2>'Ui - -0 - (* - »). 

We proceed to determine the number of real zeros of the Jacobi Polynomials 
on the intervals (— <*>, 0), (0, 1), (1, *>) into which the zeros of D divide the 
x axis. 4 The proofs proceed by mathematical induction. We first determine 
the location of the real zeros of P n (k ) x), n = 1, 2, • • • k + 1, by successive 
applications of (1) and (2). We then use the relation Pk+t (fc, x ) = Jh +1 ($, a, /3). 

Several cases concerning possible values of a and (3 should be considered. In 
order to bring out the method of procedure only two such cases will be fully 
discussed here. The results for other possible cases will be merely listed. 

Ai : a < 0, ft < 0, [ a | < | 0 | } a, (3, a + (3 not integers . 

Let ki be the greatest integer contained in a, 

ft 7» « ff a a a u o 

■ 2 r j 

“ w h be the greatest integral value of k for which a + j3 + 2k <0. Then 


0 < h < h < fa . 


4 In the case a, @ > 0 these zeros all lie, as is known, oji (0, 1). 
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An : 0 < k < h . 
Then Jk+i(x, a, /3) has 


We then have 6 > 0, ju < 0, r > 0, 0 < < 1, P[ > 0. 

(1)* + (_!)* 

= - zeros m 0, 1. These are the only real zeros. 


Proof: Consider first Pi(fc, a:). Its only zero is at a JlW , where 0 < a liJU < 1 . 
Furthermore, P[ > 0. Also Pi > 0 for a: > a ukd and < 0 for x < ai,k,\. From 
( 1 ) we see that P 2 (k, 0 , (since P x (k, a lM ) = 0, D(a litil ) > 0 and P[ > 0 ). 

From (2) it follows that P' 2 (k, x) < 0 for x < a util , P^k, a, , u ) = 0, P((/c, x) > 0 
for x > ai,t,i • These conclusions follow from remarks concerning the sign of 6, 
the fact that Pi(/b, £* 1 , 4 , 1 ) = 0, and from remarks concerning the sign of Pi to the 
left and to the right of x = £* 1 , 4 , 1 . Thus, P 2 (fc, x) > 0 for all real x and hence 
has no real zeros. By employing ( 2 ), it is now evident that P$(k, x ) > 0 . From 
( 1 ) and remarks concerning n and v we see that P 3 (fc, 0) < 0 and P 3 (k, 1 ) > 0. 
Thus Pz{k, x) has a single real zero £* 3 , 4 , 1,0 < 0 : 3 , 4 ,! < 1. The reasoning from 
P 3 to P 4 is analogous to that from Pi to P 5 . By continuing this procedure we 
finally conclude that Pk+i(k, x), (= J k+ 1 (x, a, (3), has but one real zero, (in 0 , 1 ), 
if k is even and no real zeros if k is odd. 

Ai 2 : h < k < h. Set.k = fci + q, q = 1 , 2 , • • • , k 3 — k x . Here d > 0, 
H > 0 , n = 1 , 2 , • • • q — 1 , n < 0 , n = q, q + 1 , • * • , ? + h • v > 0 , £* 1 , 4,1 < 0 , 
P[(k, x) > 0. «/*! + ? + 1 ($, a, (3) has q distinct zeros in (— 0) and 

ny=i a. 

—--- — zeros in 0, 1 . These are the only real zeros. 

z 

Proof: First consider the sequence P n (k 7 x) n = 1,2, • • * g, since the conditions 
on 0 , pt, and v do not change over this range of n. Now P\(k 7 = 0 , a lt fc,i < 

0 . Furthermore since Pi > 0 we have Pi > 0 for x > <xi >kt i and < 0 for a: < 
ai.fc.i. Pass now to P 2 (fc, #)• Since D(ai tkl i) < 0 and P[ (k f ai tk ,i) > 0 , we see 
from ( 1 ) that P 2 (ft, ai f A;,i) < 0 , Moreover ( 2 ) shows P 2 (fc, ai.jt.i) = P 2 (k, x) 
< 0 fonx < ai.fc.i and > 0 for x > a lt k,i . Thus P 2 (k, x) < 0 and a relative 
minimum at x = Since | P 2 (&, ± «) | — *>, we see that P 2 (fc, x) has two 

real zeros of which the left most, a 2 ( fc,i, is in ( — », 0). Again ^ > 0 together 
with (1) assures P 2 (fc, 0) > 0, Thus a 2li t , 2 is in , 0 ), hence in (— *a, 0), 
By continuing this reasoning on the successive P n (fc, z), n = 1 , 2 , - • g, we 
conclude that P q (k> x) has q zeros in — oo, 0 and P q (k y a q ,k,\) < 0 , 

Next, consider the sequence P n (&, $), n * 2 + 1, ff + 2 , • • • q + fa + 1 . 
Over this range of n we have 6 > 0 , & < 0, v > 0 . From what has just been 
shown, P q (k } az,k t i) = 0 , — co < a q ,k,i < 0, % = 1, 2, * • • q. Also Pg{k, a q 
i = 1 , 2, • • • q f is alternately negative and positive. Suppose q odd, (similar 
reasoning holds for q even). Thus, we suppose P f q (k , ^ 2 ^,i) < 0, P Q {k } a q ,k iq ) < 
0 , P q (k ; x) > 0 for x < a Qikl i and < 0 for x > a q ,k , q . (1) shows P s +i(fc, oc qtk , x ) } 

i = 1, 2, • • • q ) to be alternately positive and negative. Thus, the zeros a Qi k t i 
are separated by g — 1 zeros of P e +i(fc, x). Since from ( 1 ), P 5 +i(fc, ag.fc.i) > 0 
and from ( 2 ) Pg + i(fc, x) > 0 fox x < a q ,\, there exists a zero a q + uk ,x in (- 
1 ), Thus far, we have established the existence of q zeros of P q +i(k, x) in 
(— 00 , 0 ). q being odd, we have from (1), P q +i(k } a q ,k , q ) > 0 . Also from ( 2 ), 
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Pi+i(fc, x) < 0 for x > a Q ,k,Q . Again from ( 1 ) and assumptions regarding n and 
v it follows that P q +i{k, 0 ) > 0 , P fl +i(fc, 1) < 0 . Thus, P q + i(fc, x) has a zero 

a e+ll * ig+1 in ( 0 , 1 ). There being no extrema for P g+ 1 (fc, x) other than the , 

i ~ 1 , 2 , • • • q } (as ( 2 ) shows), we have thus proved that P g+ i(fc, x) has q 
distinct zeros in (- qo, 0) and a single zero in (0, 1 ). Reasoning similarly from 
P fl+ 1 (fc, x) to P g+ 2 (fc, x) we establish the existence of q distinct zeros a ff+ 2 ,u, 
i 1 , 2 , • ■ ■ q, in (— op, 0 ) with a 5 + 2 ,ju in (- <*>, a^^i) and a ff+ 2 p M, i = 

2 , 3, ■ ■ ■ q, separating o; g +i,M > * — 1, 2 , • * • g. From ( 1 ) we see that P Q+ 2 (fc, 

a qHtkl g) < 0 and P g+ 2 (fc, <x Hltk , q +i) < 0 . The only extrema of P q + 2 (k, x) f 
(as ( 2 ) shows), are located at i =* T, 2, * ■ • q + 1. Again, by ( 2 ), 

P' + 2 (fc, x) < 0 for x > a 5 + i,jb, 5+ i ; hence there can be no real zeros of P 5+3 except 
the q zeros in (- *>, 0 ) already found. The reasoning from P c+2 to P 5+3 is 
similar to that from P c to . Thus, Pq+k ^i « Jk x +q+i has q distinct zeros in 
(— oo, OX together with one zero in (0, 1) for k x even. For k x odd, there are q 
distinct zeros in (— oo, 0) only. The results are the same whether q is odd or 
even. 

The results for the remaining sub-cases under case Ai are given in the table 
which follows. For completeness, the results for cases An and A 12 are included 
in the tabulation. A few words of explanation are necessary to clarify the 
conditions under which the various sub-eases in the table occur. Let | « | = 
fci + q, | f) | ~ fc* + h f h, q < 1 . If q + & < 1, then | a + /? | = fci + fc 2 and we 
have either, 

Axai: fci + hi even , 2fe 3 =* k\ + fc* m fc 3 — k x s= — h • 

Aj32 * fci -f* fcs odd) 2 fc 3 — fci “h fc 2 — 1 — fc 3 “ fci = fcs — kz *— 1 . 

Again if 1 < 3 + h < 2 , then | a + 0 | — fci + fc 2 + 1 and we have either, 

Aj33 1 fci ~b fc 2 ~h 1 even,) 2fc 3 = fci fc 2 ~p 1 ® fc 3 — fci = fc 2 —* kz -p- 1. 

Aj34: ki -p fci H" 1 ogM, 2 fc 3 = fci + fc 2 ^ fc 3 — fci = fc 2 fc 3 
In cases Am and A 16 i we assume \ a + — k l + k 2 + p } p<l ) while in cases 

Ana and A l62 , | a + & | = fci + h -f p, 1 < p < 2 . The complete results for 
case Ai follow. (See page 213.) 

A 2 : a < 0, 0 < 0, | a | < ) |, a, /3 not integers , a -f- fi — integer. Define k x , 

h , fa as in A x . Then 0 < k x < h < k 2 . In Case A 21 , + a is odd while in 

Case A 22 ,0 + a is even* (See page 214.) 

A 3 : a < 0 , £ < 0 , a - — kx , integer , £ noJ an integer , | a | < | /3 | . Define 

fci, fo, fe as in Ai. Then 0 < fci < < fc 2 i There are two sub-cases, A 31 : the 

greatest integral value of a + j3 is odd, A 32 : this integral value is even. (See 
page 215.) 

A 4 : a < 0, < 0 , a noi an integer , jS = -fc x , integer, | a | < | fi |. Define 

fci, fa , &3 as in Ai. Then 0 < fc x < fc 3 < fc 2 . There are two sub-cases, A 41 : 
the integral part of a + /3 is odd, A 42 : this integral value is even. (See page 216). 
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As : a < o, 0 < 0 ,| a | < | 0 |, a = 
h , fa , k$ as in Ai . In cases A 61 and A 


h integer, 0 = -fc, integer. Define 
521 “ + 0 is odd and even respectively. 


Caeca 

Polynomial 

Range of Sub-Script 

_______ Zeros in 





(— 00,0) 1 = 0 

(0, 1) 

Afillj A B 2i; 

Jk+It 

0 < k < fcx; 

°; 0; 

(D* + (-l) fc 

O ) 

Ab12, A622; 


Q ~ 0, 1; 2, • ‘ - Ic Jj 

9; h + 1; 

0 

AbeJ 

J Aa+g-fli 

3 = 1,2, •••, h- k 3 - 1; 

h-h- 3; ki + 1; 

0 

Ab23) 

J ^3+5+1 j 

? = L 2, •• •, hi —■ ki — lj 

fcj — fcl — 3 -j- 1; ki + 1 ; 

0 

Ab 14, A624,* 

J = 0j 

11 

h-L 

to 



A5I6 , Ab25J 

*ffcl+fc2+ff+ 1 “ 0j 

g = 1, 2, 3, • ■ • 




If assumptions are identical with those of A 6 except | a | . | £ | then for 
0 <k <h, the results agree with Am and /* 1+J+1 = o, g = 0,1, 2, 

A,: a > 0, 0 < 0,1 a \ > | 0 |, 0 not an integer. Let h be the largest integer 


Case 

Polynomial 

Range of Sub-Script 





(0, 1) 

Aei 

/ A+l 

0 <k <h 

0 

Afi2 


Q = lj 2, 3, «• ■ 

2 


Zeros in 

(i, °°) 


(D fc + (-D‘ 

2 

(l)* 1 + (-l) fcl 

2 


A 7 : Same assumptions as in A fi except ft = — Jb A , integer, 


Case 

Polynomial 

Range of Sub-Script 


Zeros in 





(0, 1) 

X = 1 ( 1 , 00) 

A 71 

J fc+i 

0 <,k<h 

- 1 

0 

0 (1)*+(-1)* 

2 

A 72 

J Ai+g+l 

q = 0, i, 2, 


3 

Jfci -f- 1 0 

A a : a 

V 

A 

0 , | a | = 1 01 

• Ji 

= a and results for J n , n > 1 are 


identical with those in and A e respectively according as j9 is or is not an integer. 
A 9 : a > 0 , 0 < 0 , | a | < | | ; ft a + / 3 , not integers, 

Let ki be the greatest integer in a + 0. 


(( ^ it u u a u p 

u ki “ “ “ “ for which a + /3 + 2k < 0. 
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Then 0 < h < h < fa . 


Case 

Polynomial 

Range of Sub-Script 

Zeros in 



(-»,o) 

(o, 1) 

a, «) 

AfllJ 


0 < k < hi k + 1; 

0; 

0 

A921; 

J fcs+ 9 +lj 

g «* 1,2, ■ fa even; h-q + 1 ; 

o; 

0 

Ag22j 

T i 3 +2+lJ 

g =» 1,2, •••, (fcs + D; fciodd; h — Q + 2; 

0; 

l 

Ags; 

/ Ai+«+lJ 

5 « 1, 2, •••, (*j - fa); 0; 

o; 

(D fci+fl -f (-1)^ 
2 

A94J 

J fea+tt+i> 

g = 1 , 2 , 3 , •••; 0 ; 

g; 

(l)* 2 + (~1)* 2 

2 


Aio: Same assumptions as in A $ but now | a | = | 0 | . Then ki = k B = 0, 
Ji = a, and results for J» , n > 1 are the same as in A fi3 and A 94. 

An : Same assumptions as in Ag except 0 = — fe , integer . 

Case Polynomial Range of Sub-Script Zeros in 

(- 00, 0 ) ( 0 , 1 ) x~l ( 1 , 00) 

An,i Same as Asi 
Au, 2 Same as A92 
Au,s Same as Ass 

Aii ( 4 Q 1 lj 2^ 3j • • • > 0 > ^ j fca T* 1 j 0 

A12: a > 0 , 0 < 0,^1 a | < | 0 | , 0 not an integer . a + /3 = odd integer . 
Define A?!, as in A®. 

Ais: Same assumptions as in An except a + 0 = even integer. 


Cases_Polynomial Range of Sub-Soript Zeros in 


Al2,l| Al8,l J 

Same as An 

(~°°I 0) 

A12,2 ; | 

*^*«+s+i) 2 — 1> 2, • 

[Jik t +» - const. > 0; 

' • , kz; fe - ? + 1 

Al8,3 ; ■ 1 

[«^tj+(+i> 2 = 2, • 

[«/»,+» = const. > 0; 

• -1 fe + 1J fa + 2 

Au.a, Aia.a; 

Same as Am 


Al2,4, Al3,4 ; 

Same as Am 
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Ai4 : Same assumptions as in A 12 , except 0 » -ft* integer . Cases A 14(1 , 
An ,2 and Ai4,3 bave the same results as Ai2 fl , Ai2,2, and Ai2,3 respectively. 
Ai4,4 has the same results as An >4. 

Ais : Same assumptions as An except 0 = ~k 2 , integer. Cases A ua , Ai 6|2 , 
and Ai6,3 have the same results as Aig,i, Ai3 ( 2, and A^ respectively. A^.4 has 
the same results as An,4. 

Aie :a=0, 0<O,/3 — not an integer. 

Let h be the largest integer contained in 0. 
u kz be the largest integer for which 0 + 2k < 0. 


Case 

Polynomial 

Range of Sub-Script 

Zeros in 





(— “1 0) i = 0 (0,1) 

(l, 00 ) 

Aia.i; 

Jk+i; 

0 < k < k 2 ; 

A; l; 0; 

0 

Al6,2i* 

Jk s+2+1; 

, „ • , , f h-q; 1; 0; 

q = 1, 2, ■■■, ki ~ k 3 ; < 

Us-g + l; l; 0; 

0; fci even 

1; ki odd 

Aie.a; 

Jk\ +3+1 > 

Q - L2, 3, 

0; 1; q-l; 

a.)** + (-D*> 
2 

A17 

: a = 0,0 

= — Ab — odd integer. 

Define h as in A 16 . 


Ai8 

: a — 0, 0 

= — fci — even integer. 

Define k 2 as in Aie . 





Cases 

Polynomial 

Range of Sub-Script 

Zeros in 





0 

it 

H 

/-s 

0 

s' 

A 

(0,1) 

X = 1 

A 17 , 1 , Ais.i; 

Same as Aie.i 





Ai 7 , 2 j 

J Aa+g+l \ ' 

q « 1, 2, 

k 2 q\ l ) 

o; 

0 

Al8,2; 

J *3+5+1J 

q — 1,2, • • fcs +■ 1; 

ks-q + 1 ; 1; 

o; 

0 

Al 7 , 3 i Ais.a! 

J/*1+1 — 0 



g-i; 

k>\ + 1 

\j *i+a+i ; 

5 = 1 , 2, 3 , • * •, 

0; l; 

A» : a 

= 0,0 = 0. 

Ji - 0. 




Jk+1 has k — 1 zeros in (0, 1), 1 zero at x = 

0, 1 zero at x = 

1 ,k = 

1 , 2, 3 , 


From the definition of J n {x, a, 0) it is readily seen that J n ( x, a, 0) ^ (-l) n 
J„( 1 — x, 0, a). Thus, a transformation of x to 1 — x interchanges a and 0. 
The interval (— 00 ? 0) is transformed into (X, <») and vice-versa. The points 
x = 0 and x = 1 are interchanged. Consequently, in all previous results we 
may interchange properly a and 0 . 

In the foregoing results, the only real multiple zeros that can occur are at 
either x — 0 or x = 1. In the process of determining the degree of multiplicity 
of such zeros use was made of Theorem I 2 . 

Points of Inflexion. By taking (4), setting k = n, and replacing N f and D n 
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by their values for Jacobi polynomials, we get: Pn+i(n f x) = (n + 1) ( n ) 
[0 + a + n] [0 + a + n + 1] £)■ From definitions of P n (k, x) and 

Jn(x, a, 0) we easily verify that, 

P n (n d= g, a:) « J n (x } a ± q + 1, 0 ± q + 1), whence, 

J n {x ) a, 0) s= (n + 1) (w) [0 + a + ft] [0 + a + ft + 1] i/ n -:i Or, 4* 2, 0 -f 2). 


We conclude that if neither a + 0 + n nor a + 0 + n + X vanishes, the points 
of inflexion of J n+ i(x, a, 0) are at the zeros of odd order of J n -i(x> « + 2, (3 + 2). 

The Degree of Jn(x, «, 0). In analyzing the results of cases Ai to A : o inclusive, 
it is noted that in some cases the number of real zeros of J n is less than n. The 
question naturally arises whether the degree of J n is n or less, for then we can 
determine the number of its imaginary zeros. The explicit expression of 
J n (x } a, 0) is known from which the degree of J n can be found for various a and 
0. However, the degree of J n can be found from (4). 

Since J n+ i(x } a, 0) = Pn+iOh let us replace h by ft in (4) and at the same 
time replace N f and D" by their values for Jacobi Polynomials. Thus, we get: 


Jnh(x, a, 0) = II (n + 1 - t)[-0 - « - n - i]P B -«+i(n, *), 

( 5 ) *“* 

n = 0,1, 2, ■ ■ •; q - 0,1, • • •, (n + 1). 


We may establish the following results. 

CO If ct + (3 is no,t an integer, the degree of J*+i (x, a, 0) is n -f 1, n = 0, 

In fact, in order for J«ji to vanish, we see from (5) that either some factor 
— 0 — a — n — i vanishes or P*_ ff+1 (n, x) vanishes identically. We first show 
that the latter is not possible. Now Pi(n, x) = N + nD ' = (— 0 — « — 2 n) 
•x + a + n ^0 since 0 + a is not an integer. Consequently, if P^(ft, x) s 0, 
p>0, p<n-f-l there will be a first value of p, (v = v), for which P,,(ftj x) ^ 0 
but P„_i(ft, x) ^ 0, By virtue of Theorem h this means that either N + 
(n — p)D' ^ [— 0 — a — 2(ft — p)] x + « + ft — p = 0, p < r, or else there 
exist two values of p, (pi, p 2 ), such that {— — o: — 2 (ft — pi)] a? + a + n — pi 
and [— 0 — a — 2(n - p 2 )] x + a + n — p 2 are divisible by x and 1 — x 
respectively, pi, P 2 < v — 1, pi ^ P 2 . Since, however, a + 0 is not an integer 
we see that, [— 0 — a — 2(n — p)] x + a + n — p ^ 0, n and p being integers. 
This eliminates the first possibility that P^n, x) = 0, p < n + 1. Again, if, 
[— /S — a: — 2(n — pO] x + n — pi is divisible by x, we have a + n — pi = 
0 or a an integer. For (a + n — p 2 ) — [0 + a + 2 (n — p 2 )] x ^ (a + 1 n — p 2 ) 

I" 1 — (g j " 71 P*) + (/^ + n — P2) to by 1 _ x requires 0 + n — 

L (a + n - pa) J 

P 2 = 0 or 0, an integer, a and 0 are therefore both integers contrary to hypoth¬ 
esis. Thus, in (5), no polynomial P„_ fl+1 (fc, x) = 0 and /« + i(x, a } 0) ^ 0. 
Replacing q by n + I in (5) leads to, 
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(6) J< "+^ 1) <*> a >® - ft (ft + 1 - i) [-/» - a - n - i] P„(», x) , 

n = 0, 1 , 2 , .... 

Thus Jfl+i 1 '. ^ 0, (since Po(n, a:) = 1 and no factor - 0 - a - n - f can vanish) 
and the degree of J n+l is precisely n + 1. From similar reasoning we prove: 
C 2 ) If a + jS > 0 the degree of J n+1 is n + 1, n = 0,1,2, ■ • • . 

C 3 ) If a + 0 = 0, then (I) J 1 = a and (II) J n+l is of degree n + 1, n = 1, 

2, 3, •■ • 

Ci) If a + 0 = -M - integer, M > 0, 0, a not integers, then, 

(I) For n < M, the degree of j B+l is min. (n + 1, M - n). 

(II) n = M, /„ + i s const. 

(Ill) n > M, the degree of J„ 41 is n + 1. 

Cs) If a + 0 = — M - integer, M > 0, a, 0 integers, a > 0, 0 < 0, then, 

(I) For n < M, the degree of J n+ 1 is min. (n + 1, M — n). 

(II) n = M, J n+ 1 = const. 

(Ill) n > M, the degree of J n +i is n + 1. 

C«) If a + j 8 = — M - integer, M > 0, a = - /^-integer, 0 = - ^-integer, 

ki < hi then, 

(I) For n < kt , J M1 is of degree n + 1. 

(II) n > k it J n+ 1 s 0. 

C 7 ) If a + 0 = — M — integer, Af > 0, a = 0 = — fci-integer, then, 

(I) For n < ki, /„ + i is of degree n + 1, 

(II) n > fci, J n+ i = 0. 

The Laguerre Polynomials. These are defined as follows: 

Ln * P n (a:, «) = ^ [<T* n - 0,1,2,. • ■; 

a — real. We see that L n is a special case of P„(fc, x) with N = — x -f, a, 
D == x, n — k + 1. It follows that 6 = —1, n = a + k — n, am = a + k, 
and P'i(k, x ) = 1., These can be used in determining the location of the real 
zeros of L n , as was done for J„ . The discussion here is somewhat simplified 
since L n has but one parameter, a, and the x-axis is divided by the zeros of D(x) 
into,two segments only, namely, (— 0) and (0, 00 ). 

The following results are easily obtained. 

Bi: a > 0, L n (x, a) has n distinct zeros in (0, «j), n = 1, 2, 3, • • • . This 
result is well known. 

B 2 \ a — 0. L„ +1 {x, a) has n distinct zeros in (0, °°) and a simple zero at x = 0, 
n = 0, 1, 2, • • • . 

B 3 : a < 0, a, not an integer. Let h be the largest integer contained in a. 

(I) L k+1 (x, a) has — zeros in (- 00 , 0), 0 < k < h, 
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, (!)*» -f (_l) fcl 

(II) Lkt+g+iix, a) has q distinct zeros in (0, «>) and - - - zeros in 

(- co, 0), 2 = 0,1, 2, ... . 

B 4 : a < 0, a = —h - integer. 

(I) L k +i(x, a) has — — zeros in (- », 0), 0 < k < fa. 

(II) Li 1+a+ i(x, a) has q distinct zeros in (0, “) and a zero of order ki + 1 at 
x *s 0, q = 0,1, 2, • • • . 

The Degree of L n (x, a). We show first that here Pn(n, x) 0, /i = 1, 2, •. • 
n + 1. By definition, Pi(n, x) s N + nD' = —x + a + n ^ 0. Let us 
rewrite (2) for our present situation thus: 

(2°) P'(n, x ) = -nP^-iin, x ). If, now, P^n, x ) = 0, then from (2 Q ) it follows 
that P„_i(n, x) s 0. Continuing this reasoning, we finally arrive at a contra- 
diction, namely, Pi(n, x ) s 0. If in (4) we set q = n + 1 and replace N 1 and D" 
by their values we get: 

a) = (-l) n+1 (n + 1)! Po(», *) - (—l) n+1 (n + 1)1 

Hence, is of degree n + 1. Note that this holds regardless of the value of 
a contrary to what was found for Jacobi Polynomials; 

Points of Inflexion. By a procedure analogous to that used for Jacobi Poly¬ 
nomials we can show that the points of inflexion of L n +i(x } a) are located.at the 
zeros of odd order of L n -i (x, a + 2). 

The Polynomials P n ( 0, x). If we set k = 0 in (1), (2), and (3) we obtain the 
following relationships for P n (0, %)* = P n (%) 35 Pn . 

(7) P n * i{x) = [N~ nD'} P n (x) + DPUx). 

(8) K»(x) = (n + 1) [iV' - I D"] P»(x). 

(9) P n+1 (x) - [N - nD 1 } P„(x) + D'j DP n ^(x). 

Theorems 1{ to 7i 0 inclusive, with k = 0, hold for P n (x). In addition, the 
following theorems hold for P n ■ 

Theorem Hi. Suppose N linear and D(x) > 0 for all x. Furthermore } let 
m 

N' — D" < 0 , w — 1, 2, 3, * «• . Then P n has n real , distinct zeros which 
separate the zeros of P n+1 . 

Proof: Denote the zeros of P n by «».,*, i - 1,2, * • • n, ot n .i < a n ,i+i . Suppose 
N f > 0. N being linear has a single zero an . Furthermore, since Pi = , 

then Pi < 0 for x < an and > 0 for x > an . We pass now to P 2 • From (7), 
we see that P 2 (an) > 0, (since jD > 0 and Pi > 0). Also (8) shows Pz(x) > 0 


6 E, H. Hildebrandt, Ioc, cit. pp. 399 . 
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for x 's an and <C 0 for x an , This follows from what was noted concerning 

the sign of P x for x > eta and x < « n , together with the hypothesis that N' - - 

2 

D" < 0. Thus, there exists a zero of P 2 in (- «, « u ) and a zero in (a u , «) 
and our theorem holds for n = 1. Assume that the theorem is true for n = h. 
The sequence Ph{oih,i), t = 1, 2, • • • h, is alternately positive and negative. 
Since, from (8), the only extrema of P h+1 are at a hlt , i = 1,2 ,■ • ■ h, we conclude 
that there are h — 1 zeros of P/ l+ i separating the , i = 1, 2, • ■ • h. Since 
Ph{uh,i) > 0 we conclude that P h < 0 for x < a h ,i . This fact, combined with 
(8), shows P'h+i(x) > 0 for x < a h ,i . Pa+i(“m) being positive, it follows that 
there exists a zero of P *+1 in (- <», a w ), Similar reasoning establishes the 
existence of a zero of P h +i in (a hlh , ■»). Our theorem is thus established for 
JV 7 > 0. The case N' < 0 can be similarly treated. 

Theorem Hz : If D(x) > 0 for all x, D" < 0, N' - ^ D" < 0, N' = 0, N ^ 0, 

then P n , n = 2, 3, • • • , has n — 1 real, distinct zeros which are separated by the 
zeros of P„_i. 

Proof: Since Px = N = const., we see from (7) that P 2 is linear. The reason¬ 
ing of Theorem Hi applies where we now start with P 2 . 

Theorem Hi: If D(x ) > 0 for all x, except x = j3, where D has a double zero and 

if N' ^ 0 , N' — - D" < 0, n = 1, 2, 3, • • • , then P n has n real, distinct zeros 

which separate those of P„ +1 . 

Proof: Theorem 7i with k = 0 assures us that P n and D have no zeros in 
common. The proof now follows the line of reasoning of Theorem Hi. 

Theorem Hi: If D(x) > 0 for all x except x = where D has a double zero and 

if N' = 0, N 0, N' — ^ D" < 0, m - 1, 2, 3, • • • , then P„ has n — 1 real, ■ 

distinct zeros which separate those of P n+ i 7 n = 1 , 2 , 3 , • • • . This theorem follows 
from Hz as did from Hi . 

Points of Inflexion. Setting ft = 0 in (4) leads to, 


p ,! 

* n-f-l 


{n + 



P n—X ■ 


This shows, under the assumptions of Theorems Hi to Hi inclusive, that the 
points of inflexion of P n +i are at the zeros of P n ~i • 

Hermite Polynomials . Theorem Hi and statement immediately above con¬ 
cerning points of inflexion apply directly to Hermite Polynomials where N = — x 
and D = cr 2 . 


Lehigh University. 



THE SIMULTANEOUS COMPUTATION OF GROUPS OF REGRESSION 
EQUATIONS AND ASSOCIATED MULTIPLE CORRELATION 
COEFFICIENTS 

I ■ , 

By Paul S: Dwyek 

1 . Introduction. The need sometimes arises for the prediction of a number of 
different variables from a given group of so-called fundamental variables. In 
the work of college prediction, for example, one might desire regression equations 
predicting certain measures of college achievement (e.g., first semester average, 
first semester English grade, first semester mathematics grade, number of hours 
of A received during first semester, etc.) on the basis of a number of other factors 
(e,g., high school record, score on American Council on Education Psychological 
Examination, score on some standard English achievement test, score on some 
standard mathematics achievement test, etc.). It is the purpose of this paper 
to show how the regression coefficients and the associated multiple correlation 
coefficients can be obtained simultaneously. The essence of the method is a 

, simple device by which one solution of general normal equations may be made to 
serve for all cases. 

2 . The normal equations. Let x h , x a , • • • x n , be the so-called funda¬ 
mental variables and let Xk be the predicted variable. The normal equations 
are computed by standard methods which result in one of the three types. 

Type I.' Normal equations for determining bo , hi, h, b s , • • . , . 

bon bjSaii -p b 2 2x j -f- bjSiCj 4*.4“ b n 2a;„ — = 0 

b$Xi + bi'Sxl *p biZxiXi -f bo2x\Xi 4 ". 4 " b„2xix n — 2 X\Xk = 0 

bo 2 Z 2 4 " biliXiXi 4 ” b^Zzl 4 “ bz'EXzXa 4 ■ • • •. 4 “ bnhXiXn — 'LXlXk = 0 


bo 2 a; n 4- b{SxnXi 4* biLx n x 2 -p bsiZxnXt 4"... -p b n 2x~ n — 2z„Xk = 0 

Type II. Normal equations for determining hi , 62 , b 3 , ■ • • , b„. 

Xf — Zj Mx{ 

bi^xl 4~ hXiiXi 4- b 3 2 xix 3 -p. 4 - b n lixix n — 2 f i Xk — 0 

b{2x 2 $i -P bi^xl -p bzZxtXz 4- — ..-p b n M 2 x n - 2 XoXk = 0 

b{Sx n xx 4- bi2x n Xi 4- bi2x n x 3 -P.. 4 - b n I,xl — 2 x n Xk = 0 
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Type III. Normal equations for determining ft, ft , ft , ... , ft . 

ft + n 2 ft + naft +.+ n n ft - ru = 0 

? 2 ift + ft + r 23 ft +.+ r 2n ft - r 2fc = 0 


r«ift + r n2 ft + r„ 3 ft +..+ r nn ft - r nfc = 0 

The three types are special cases of the general 

duVi + d l 2 y 2 + tWa +.+ ft#, +.+ d u y n — da = 0 

+ d 22 y 2 + d^ys + d 2 i y i +.+ ft n ?/» — ft* = 0 

ftl2/l + <^32^/2 + ^332/3 +• + ft/2/j +.+ ftnZ/n — ftfe = 0 

ftift + d i2 y 2 *+* di-32/3 +.+ d^yj +.+ di n y n — da = 0 


dnlVl + d n2 y 2 d-n^yz +.+ dn/2/y +.+ — dnk = 0 

where y,- are the regression coefficients and da = d*. 

The methods described in this paper are applicable to the general case and 
hence to each of the three particular types. 

In examining the normal equations, it is noticed that the first n terms of each 
equation are completely determined by the n fundamental variables. The 
equations, aside from the last terms, are identical no matter what variable is 
predicted. It is only necessary to devise a technique for separating the con¬ 
tributions of the da terms. 

3. Solution by determinants. One method utilizes determinants. The 
value yj is expressed in terms of a determinant involving a column with entries 
du , ftfc, d 3 k j ’ • • , dnk . The determinant is expanded in terms of this column. 

Specifically, let D be the determinant of the coefficients of the yj and let Da 
be the cofactor of any element da of D. Then 

n 

D = D%j dij 

»=i 

and 

yi = ^ (Dn dik H" D 21 ft* H - -Dai du "b • ♦ • • “b Dyi djk *b ■»•■~b D n i ft* •) 

2/2 =f j) (D ]2 dik + D 22 dik ,+ D 32 dik +« ■ • • -b D /2 djk + • * ■ * *b D n 2 d n k .) 


yi ss i (Djiftfc + D 2 <ft* + Dsidu + «••• + D/i djk +•■'•+ Dnidnk') 


y n = (Dm du + D 2ft ft* + Dgn ft* +••••+ Djn ft* +...■+ D nn d n*.) 
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It is only necessary to compute ~ to find the coefficient of djk in the expansion 
of Vt. 

An illustration is given. The normal equations are 

ft + .3300 pi + .2100 ft - ru = 0 

.3300 ft + fa - .4800 fa - ru = 0 

.2100 ft — .4800 ft 4- ft — Tik — 0 

from which at once 


ft = I (.7696 ru - .4308 n k - .3684r 3fc ) 
ft = (—.4308 ru + .9559 r» + .5493 r Sk ) 

ft - ^(-.3684rut + .5493r 2 * + .8911 r,*) 

and also 

jD = .550072 = (LOO) (.7696) + (,33)(-.4308) + (.21)(—.3684) 

= (.33) (—.4308) + ( 1.00) (.9559) + (-.48)(.5493) 
= (.21) ( — .3684) + (-.48)(.5493) + ( 1.00)(.8911) 

so that 

ft = 1.3991 Tik - .7832 r 2 * - .6697 r 3fc . 
ft — —.7832 Tik ~h 1.7378 r^ -J- .9986 . 


ft = —.6697 rik + .9986 fu + 1.6200 Tu . 

It is only necessary to insert any given values r ifc , r 2fc , r 3 t, to obtain the coeffi¬ 
cients of any specific regression equation. 


4. Solutions without determinants. Theoretically the solution by deter¬ 
minants is excellent but as the number of variables increases the work of com¬ 
puting the n cofactors or the ^ — different cofactors 1 becomes enormous. 


We desire a technique for separating the contributions of the last terms when 
determinants are not used. This can be accomplished by using a separate 
column for each da . Before algebraic manipulation; the value da is factored 
from the column and, after manipulative solution is complete, the multiplication 
by da. is carried out. 
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As an example consider the normal equations 

01 + ?’l202 — 7\k — 0 
Pi02i + 02 — r 2 k = 0 

where ?T 2 = ni = .3300. Then the normal equations may be represented by 
rows (1) and (2) of Table I. 


TABLE I 


Row 

Operation 

01 

02 

7*1* 

7*2* 

(i) 


1.0000 

.3300 

-1.0000 


(2) 


.3300 

1.0000 


-1.0000 

(3) 

— . 3300 times (2) 

- .1089 

- .3300 


.3300 

(4) 

(1) + (3) 

.8911 


-1.0000 

.3300 

(5) 

— (4) divided by .8911 

-l.OOOOi 


1.1222 

- .3703 

(6) 

- .3300 times (5) 

..3300 


- .3703 

.1222 

(7) 

~ (2) + (6) 


-1.0000 

- .3703 

1.1222 


The four decimal place solution, whose steps are indicated by (3) (4) (5) (6) (7), 
is from (5) and (7) 

0! = 1.1222 r lk - .3703 r 2k 
02 - -.3703 + 1.1222 r 2k 

This device may be combineu with most of the standard methods of solving 
normal equations. 

5. Combination with Doolittle method. Especially to be recommended is a 
combination of this device with the Doolittle method which is recognized as a 
most efficient method of solving normal equations in from five to ten variables 
[1] [2]. One of the advantages of the Doolittle method is that related multiple 
regression coefficients may be obtained from the same forward solution, though 
additional back solutions are necessary [3]. 

The problem which led to the development of this technique was the simul¬ 
taneous prediction of scores on various occupations covered by the Strong 
Vocational Interest Blank from the scores on a few fundamental occupations. 
A multiple factor analysis revealed that five basic factors account for most of the 
scores. Five occupational scores, serving as approximations to the five basic 
factors, were used as the fundamental variables and the other scores were 
predicted from them. 

As an illustration of this prediction technique combined with the Doolittle 
method, I have selected three test scores as fundamental since the solution based 
on them shows all the steps of the Doolittle method and is shorter than the five 
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variable problem. Actually, solution by determinants (section 3) is advised 
for problems involving three variables. The steps of the Doolittle solution are 
presented in Table II. The results should be compared with those of the 
determinant solution of section 3. 

The first column indicates the row md the second the description of the 
algebraic operation. The next three columns are the standard columns of a 
Doolittle presentation with the conventional elimination of the lower left entries. 
The next three columns carry through the Doolittle method with the values 
Tik f T 2 k, r 3 k kept in separate columns. The last column is an adaptation of the 
conventional summary check column of the Doolittle solution, 

TABLE II 


Generalized Doolittle Presentation 



Operation 

Pi 1 

02 

l>* 

rife 

T7fe 

TSfe 

S 

(i) 



.3300 

.2100 

-1.0000 




(2) 


.3300 

1.0000 

-.4800 





(3) 


.2100 

-.4800 

1.0000 





(4) 

Repeat (1) 


B 

.2100 

-1.0000 




(5) 

Negative of (4) 

11131 


-.2100 

1.0000 



Bg|j 

(6) 

Repeat (2) 



-.4800 





(7) 



-.1089 

—.0693 




-.1782 

(8) 

(6) + (7) 


.8911 

-.5493 

.3300 

-1.0000 


-.3282 

(9) 

— (8) divided by 



.6164 

-.3703 

1.1222 


.3683 


.8911 








(10) 

Repeat (3) 








(11) 

— .2100 times (4) 







-.1134 

(12) 

.6164 timeB (8) 



-.3386 

1 

- .6164 


—.2023 

(13) 

(10) + (11) + (12) 



.6173 

.4134 

-.6164 


- .5857 

(14) 

— (13) divided by 




-.6697 

.9985 

1,6200 

.9488 


.6173 








(15) 

.6164 times (14) 



-.6164 

-.4128 

.6155 

,9986 

.5848 

(16) 

(9) + (15) 


Egggg 


-.7831 

1.7377 

.9986 

.9531 

(17) 






—.2097 

-.3402 

-.1992 

(18) 



.3300 


.2584 

-.5734 

-,sm 

-.3145 

(19) 

(6) + (17) + (18) 





-.7831 

-.6697 

-1.0537 


The general solution is read from rows (19) (16) (14) and is 
A = 1-3990 r lh - .7831 r ik - .6697 n k . 
A = —.7831Xu + 1.7377.r 2i + .9986 r 3k . 
A = -.6697 r lk + .9985 r 2k + 1.6200 r 3k . 
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which agrees, aside from the last place, with the result of the solution by de¬ 
terminants. 

It is wise to check in the original equations (1), (2), (3) as soon as any ft is 
found. Row (14), for example, should be checked by showing 

(-.6697) (1.00) + (.9985) ( .33) + (1.6200) ( .21) = .0000 

(—,6697)( .33) + (.9985)( 1.00) + (1.6200)(-.48) = -.0001 

(—.6697)( .21) + (.9985)(—.48) + (1.6200)( 1.00) = 1.0001 

The same should be done with row (16) as soon as it is computed. Row (19) 
should be treated similarly. 

6. Many regression equations. If large numbers of regression equations are 
to be generated (the Strong Vocational Interest Study had 29 dependent va¬ 
riables), the following technique is suggested. Make a table with columns 
rik , ?’ 2 fc , etc. and use the rows to indicate the different values of k. On another 
slip of. paper insert the general values ft , ft , ft, • • • ft in successive rows so 
that a folding of the paper will bring any general /? expansion in conjunction 
with the r’s of any test, h. The scheme is illustrated in Table III. 


TABLE III 


No. 

Occupation 

nfe 


Tik 


ft* 

'/ft it 



r 

1 

Teacher 

1.00 

.33 

.21 


1.00 

.00 

.00 


1.00 

■ 2 

Physicist 

.33 

1.00 

-.48 


,00 

1.00 

.00 


LOO 

3 

Office Worker 

.21 

-.48 

* 1.00 


.00 

.00 

1.00 


1.00 

< 4 

Doctor 

.17 

.79 

-.52 


-.03 

.72 

-.17 


.81 

5 

Lawyer 

-.02 

.16 

-.59 


.24 

-.30 

-.78 


.64 

■' 6 

Engineer 

.16 

.78 

-.02 


1 

CO 

1,21 

.64 


.93 







t 






ft 


-.7831 

- .6697 



t 





ft 

-.7831 

1.7377 

.9986 




t 




ft 

-.6697 

.9986 





HU 



E9 

Mathematician 

.46 

.96 

-.49 


.19 

.82 

-.14 


.97 


etc. 











Thus, for the occupation of Engineer, 

ft = 1.3990 (.16) + (—,7831)(.78) + (-.6697)(-.02) = -.37 

ft = -.7831 (.16) + ( 1.7377) (.78) + ( .9996) (-.02) = 1.21 

ft = -.6697 (.16) + ( .9985)(.78) + ( 1.6200)(-.02) =■ .64 
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The value of the multiple correlation coefficient is then computed from the 
formula 

rfc,l23 -n = \/| Plk^ik + p2kT2k +**••+ ffnk^nk 

In the illustration above 

r fc< i 23 = V(-.37)(.16)+'(1.21)078) + (.64)(-.02) 

« .93 

7. Regression equations by deletion. The method of getting related regres¬ 
sion coefficients and correlation coefficients, described by Kurtz [3], is also 
applicable. Again, a problem involving more than three variables is needed to 
show the real value of the scheme but the technique may be illustrated in the 
three variable case. We wish to find, from the forward solution of Table II, 
the regression equation and the multiple correlation coefficient when the first two 
fundamental variables only are used. We delete all columns involving test 3 
and complete the back solution as indicated in Table IV, which may be viewed 
as a substitute for the last ten rows of Table II. 


TABLE IV 
(See Table II) 


Row 

Operation 


fi* 

fa 

njb , 


Tile 

S 

(20) 

(21) 

(22) 

Repeat (9) 

— .3300 times (20) 

(5) + (21) 

-1.0000 

-1,0000 

.3300 


-.3703 

.1222 

1.1222 

1.1222 

-.3703 

-.3703 




The results are 

ft = 1.1222 r u —.3703 r 2 * . 
ft = -.3703 r u + 1.1222 r^k . 
and these agree with the results of section 4. 

8. The simplified back solution. In every case in which the fts have been 
given in terms of r ’s the matrix of the coefficients is symmetric (sections 3, 4, 5, 7). 
One wonders if this symmetry is generally true and if it holds for normal equa¬ 
tions of Type I or Type II. 

Determinants are much more useful in establishing general properties, such 
as the one under discussion, than they are in computing the values of regression 
coefficients in the case of a problem involving many variables, r; We return to the 
determinant notation of section 3, 

In each of the three types, and hence in the general case da = dji so that D is a 
symmetric determinant, Da = D# and ^ Hence the matrix of the 

coefficients of the solution is symmetric. 
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This result may be used (1) to cheek the expanded results or (2) to eliminate 
some of the work of the back solution. The n coefficients must be recorded for 

after which the column indicated by r n h may be dropped. The first n - 1 
coefficients must- be computed for p n -i after which the column indicated by 
rn-i.k may be dropped, etc. The italicized entries in Table II are the ones 
which are eliminated in this way. The remaining coefficients are sufficient to 
completely determine the symmetric matrix. 

The summary right hand check column can not be readily used in the simpli¬ 
fied back solution but it is hardly to be ^recommended anyway, Kurtz [3] 
argues against it on the ground that it is not necessary. The essential check is 
to see that each /3 solution satisfies all of the original equations. 

9. Conclusion. This paper provides a technique for the computation of 
general regression equations and shows how the technique may be combined 
with the Doolittle method in providing a practical means of mass prediction. 

University of Michigan. 
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CONSTITUTION 


Article I 

NAME AND PURPOSE 

1 . This organization shall be known as the Institute of Mathematical Star 
tistics. 

2. Its object shall be to promote the interests of mathematical statistics. 

Article II 
MEMBERSHIP 

1 . The membership of the Institute shall consist of Members, Fellows, 
Honorary Members, and Sustaining Members. 

2 . Fellows shall be the only voting members of the Institute. 

Article III 

OFFICERS, BOARD OF DIRECTORS, COMMITTEE ON MEMBERSHIP, 
AND COMMITTEE ON PUBLICATIONS 

1 . The Officers of the Institute shall be a President, two Vice-Presidents, 
and a Secretary-Treasurer, elected for a term of one year by a majority ballot 
at the annual meeting of the Institute. Voting may be in person or by mail. 

(a) Exception. The first group of Officers shall be elected by a majority 
vote of the individuals present at the organization meeting, and shall serve until 
December 31,1936. 1 

2 . The Board of Directors of the Institute shall consist of the Officers and 
the previous President. 

3. The Institute shall have a Committee on Membership composed of three 
Fellows. At their first meeting subsequent to the adoption of this Constitution, 
the Board of Directors shall elect three members as Fellows to serve as the 
Committee on Membership, one member of the Committee for a term of one 
year, another for a term of two years, and another for a term of three years. 
Thereafter the Board of Directors shall elect from among the Fellows one 
member annually at their first meeting after their election for a term of three 
years. The president shall designate one of the Vice-Presidents as Chairman 
of this Committee. 

4. The Institute shall have a Committee on Publications composed of three 
Members or Fellows elected by the Board of Directors. The President shall 
designate a Vice-President as Ex Officio Chairman of this Committee. 
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Article IV 
MEETINGS 

1. A meeting for the presentation and discussion of papers, for the election of 
Officers, and for the transaction of other business of the Institute shall be held 
annually at such time as the Board of Directors may designate. Additional 
meetings may be called from time to time by the Board of Directors and shall be 
called at any time by the President upon written request from ten Fellows. 
Notice of the time and place of meeting shall be given to the membership by the 
Secretary-Treasurer at least thirty days prior to the date set for the meeting. 
AH meetings except executive sessions shall be open to the public. Only 
papers accepted by a Program Committee appointed by the President may be 
presented to the Institute. 

2. The Board of Directors shall hold a meeting immediately after their 
election and again immediately before the expiration of their term. Other 
meetings of the Board may be held from time to time at the call of the President 
or any two members of the Board. Notice of each meeting of the Board, other 
than the two regular meetings, together with a statement of the business to be 
brought before the meeting, must be given to the members of the Board by the 
Secretary-Treasurer at least five days prior to the date set therefor. . Should 
other business be passed upon, any member of the Board shall have the right to 
reopen the question at the next meeting. 

3. The Committee on Membership shall hold a meeting immediately after the 
annual meeting of the Institute. Further meetings of the Committee may be 
held from time to time at the call of the Chairman or any member of the Com¬ 
mittee provided notice of such call and the purpose of the meeting is given to 
the members of the Committee by the Secretary-Treasurer at least five days 
before the date set therefor. Should other business be passed upon, any 
member of the Committee shall have the right to reopen the question at the 
next meeting. 

4. At a regularly convened meeting of the Board of Directors, three members 
shall constitute a quorum. At a regularly convened meeting of the Committee 
on Membership, two members shall constitute a quorum. 

Article V 
PUBLICATIONS 

1. In the beginning, the “Annals of Mathematical Statistics” shall serve as 
the official journal for the Institute/ Other publications may be originated 
by the Board of Directors as occasion arises. 

Article VI 

EXPULSION OR SUSPENSION 

■ 1. Except for non-payment of dues, no one shall be expelled or suspended 
except by action of the Board of Directors with not more than one negative vote. 
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Article VII 
AMENDMENTS 

1. This constitution may be amended by an affirmative two-thirds vote at 
any regularly convened meeting of the Institute provided notice of such proposed 
amendment shall have been sent to each Fellow by the Secretary-Treasurer at 
least thirty days before the date of the meeting at which the proposal is to be 
acted upon, Voting may be in person or by mail. 

BY-LAWS 

Article I 

DUTIES OF THE OFFICERS, BOARD OF DIRECTORS, COMMITTEE. 
ON MEMBERSHIP, AND COMMITTEE ON PUBLICATIONS 

1. The President, or in his absence, one of the Vice-Presidents, or in the 
absence of the President and both Vice-Presidents, a Fellow selected by vote 
of the Fellows present, shall preside at the meetings of the Institute and of the 
Board of Directors. At meetings of the Institute, the presiding officer shall 
vote only in the case of a tie, but at meetings of the Board of Directors he may 
vote in all cases. At least three months before the date of the annual meeting, 
the President shall appoint a Nominating Committee of three members. It 
shall be the duty of the Nominating Committee to make nominations for 
Officers to be elected at the annual meeting and the Secretary-Treasurer shall 
notify all Fellows at least thirty days before the annual meeting. Additional 
nominations may be submitted in writing, if signed by at least ten Fellows of 
the Institute, up to the time of the meeting. 

2. The Secretary-Treasurer shall keep a full and accurate record of the 
proceedings at the meetings of the Institute and of the Board of Directors, 
send out calls for said meetings and, with the approval of the President and the 
Board, carry on the correspondence of the Institute. Subject to the direction 
of the Board, he shall have charge of the archives and other tangible and 
intangible property of the Institute. He shall send out calls for annual dues and 
acknowledge receipt of same; pay all bills approved by the President for expendi¬ 
tures authorized by the Board or the Institute; keep a detailed account of all 
receipts and expenditures, prepare a financial statement at the end of each year 
and present an abstract of the same at the annual meeting of the Institute after 
it has been audited by a Member or Fellow of the Institute appointed by the 
President as Auditor! The Auditor shall report to the President. 

3. The Board of Directors shall have charge of the funds and of the affairs 
of the Institute, with the exception of those affairs specifically assigned to the 
President or to the Committee on Membership. The Board shall have au¬ 
thority to fill all vacancies ad interim, occurring among the Officers, Board of 
Directors, or in any of the Committees. The Board may appoint such other 
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committees as may be required from time to time to carry on the affairs of the 
Institute. 

4. The Committee on Membership shall prepare and make available through 
the Secretary-Treasurer an announcement indicating the qualifications requisite 
for the different grades of membership. 

5, The Committee on Publications, under the general supervision of the 
Board of Directors, shall have charge of all matters connected with the publica¬ 
tions of the Institute, and of all books, pamphlets, manuscripts and other 
literary or scientific material collected by the Institute. Once a year this 
Committee shall cause to be printed in the Official Journal the Constitution 
and By-Laws and a classified list of all the Members and Fellows of the Institute. 

Article II 
DUES 

1. Members shall pay five dollars at the time of admission to membership 
and shall receive the full current volume of the Official Journal Thereafter, 
Members shall pay five dollars annual dues. The annual dues of Fellows shall 
be five dollars. The annual dues of Sustaining Members shall be fifty dollars. 
Honorary Members shall be exempt from all dues. 

2. Annual dues shall be payable on the first day of January of each year. 

3. The annual dues of a Fellow or Member include a subscription to the 
Official Journal. The animal dues of a Sustaining Member include two sub¬ 
scriptions to the Official Journal. 

4. It shall be the duty of the Secretary-Treasurer to notify by mail anyone 
whose dues may be six months in arrears, and to accompany such notice by a 
copy of this Article. If such person fail to pay such dues within three months 
from the date of mailing such notice, the Secretary-Treasurer shall report the 
delinquent one to the Board of Directors, by whom the person's name may be 
stricken from the rolls and all privileges of membership withdrawn. Such 
person may, however, be re-instated by the Board of Directors upon payment 
of the arrears of dues. 


Article III 


SALARIES 

1. The Institute shall not pay a salary to any Officer, Director, or member of 
any committee. 

Article IV 
AMENDMENTS 

1. These By-Laws may be amended in the same manner as the Constitution 
or by a majority vote at any regularly convened meeting of the Institute, if the 
proposed amendment has been previously approved by the Board of Directors. 
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